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Preface 


Tn  1951  the  Section  of  Biological  Chemistry  of  the  International  Union  of 
Pure  and  Applied  Chemistry  (lUPAC)  established  a  Protein  Commission 
largely  for  the  purpose  of  investigating  the  possibility  of  making  available  on 
an  international  basis  a  few  protein  preparations  meeting  reasonable  cri- 
teria of  homogeneity.  This  task  was  found  to  be  more  difficult  than  was 
foreseen,  but  early  in  1957  an  insulin  preparation,  made  by  the  British 
Insulin  Manufacturers,  and  examined  by  countercurrent  and  chromato- 
graphic methods,  was  made  available.  During  discussion  on  a  possible 
extension  of  this  programme,  it  became  apparent  that  the  work  of  the 
Commission  would  be  greatly  helped  and  the  value  of  any  future  plans  be 
better  assessed,  if  the  question  of  homogeneity  or  'purity'  of  proteins  could 
be  considered  by  a  special  symposium  at  which  many  of  the  leading  protein 
chemists  would  be  present.  Financial  support  by  lUPAC  was  promised  in 
the  most  generous  manner  and  it  was  decided  to  hold  such  a  conference 
in  Paris  in  the  summer  of  1957.  As  the  planning  of  this  symposium  pro- 
gressed it  became  clear  that  it  was  not  advisable  to  restrict  the  subjects  to 
be  discussed  too  closely  to  questions  of  immediate  relevance  to  the  problem 
of  homogeneity  and  in  the  end  the  programme  of  the  conference  was 
arranged  to  give  an  all-round  survey  of  protein  chemistry  at  the  present 
time. 

The  lUPAC  Symposium  on  Protein  Structure  took  place  in  Paris  from 
25  to  29  July  1957  under  the  general  chairmanship  of  Professor  Jean  Roche. 
The  Symposium  was  attended  by  over  150  protein  chemists,  but  it  was  felt 
that  a  much  larger  number  of  people  might  like  to  see  in  print  the  main 
papers  given  in  Paris.  These  are  now  collected  in  the  present  volume,  ampli- 
fied by  the  contributions  of  many  speakers  in  the  Discussion.  Unfortunately 
it  has  not  been  possible  to  include  all  the  comments  which  were  made  and 
only  those  which  gave  new  experimental  facts  directly  relevant  to  the  de- 
fined topics  of  the  Conference  are  presented.  In  editing  this  book  contain- 
ing papers  written  in  three  different  languages,  a  considerable  degree  of 
latitude  has  been  adopted.  The  system  of  quoting  literature  references  has 
been  allowed  to  vary  greatly,  and  both  EngHsh  and  American  spelling  has 
been  permitted. 

We  would  like  to  thank  all  those  who  have  helped  to  make  the  Paris 
symposium  a  success  and  the  publication  of  this  volume  possible.  Professor 
Murray  Luck,  the  President  of  the  Section  of  Biological  Chemistry,  and 
other  Honorary  Officers  of  lUPAC  have  given  helpful  support  at  all  stages. 
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Professor  Roche,  Vice-President  of  the  Commission,  was  responsible  for  the 
local  arrangements  and  organized  the  Symposium  in  a  most  competent 
manner  with  unfailing  courtesy.  Dr  Stanford  Moore  deserves  the  main 
credit  for  the  efficient  way  in  which  the  scientific  side  of  the  meeting  was 
planned.  We  are  also  indebted  to  Dr  Green  for  his  help  with  the  proof 
reading  and  for  providing  the  index.  Finally,  I  wish  to  express  our  apprecia- 
tion to  the  publishers  for  their  helpful  co-operation  in  bringing  out  this 
volume  so  quickly  and  so  efficiently. 

ALBERT  NEUBERGER 
LONDON 

May  1958 


Claude  Fromageot 

1899-1958 

It  is  with  profound  regret  that  we  have  to  record  the  untimely  death  on 
January  10th,  1958  of  Professor  Claude  Fromageot,  whose  contribution  to 
the  Protein  Symposium  appears  in  this  volume.  Fromageot  was  Secretary 
of  the  Biological  Section  of  I.U.P.A.C.,  and  with  Professor  Jean  Roche 
contributed  a  great  deal  to  the  success  of  the  Symposium.  By  his  death, 
France  has  lost  one  of  her  leading  biochemists,  who  was  known  internation- 
ally for  his  distinguished  work  in  many  fields,  and  whose  eminence  was 
achieved  by  the  most  selfless  and  disciplined  devotion  to  his  researches. 
These  are  notable  not  only  for  their  quality  but  also  for  their  unusual  variety, 
ranging  from  structural  aspects  of  protein  chemistry — exemplified  by  papers 
on  vasopressin,  ovomucoid  and  lysozyme — to  pioneer  work  on  sulphur 
raetaboUsm  and  the  stability  of  enzymes.  In  addition  to  the  responsibiHties 
incurred  by  his  duties  as  Professor  of  Biological  Chemistry  in  the  Faculty  of 
Science  at  the  Sorbonne  and  the  guidance  of  several  teams  working  under 
him,  he  found  time  to  contribute  most  valuable  review  articles,  of  which 
perhaps  that  with  Bricas  on  'Naturally  occurring  peptides'  is  outstanding. 

The  loss  to  biochemistry  is  great,  but  even  greater  to  those  who  knew  him 
is  the  loss  of  a  colleague  whose  integrity  was  a  shining  example  to  all. 
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Factors  affecting  the  structure  of  hemoglobins 
and  other  proteins 


LINUS  PAULING 

and  Crellin  Laboratories  of  Chen 
a  Institute  of  Technology,  Pasader 

(Communication  No.  2215) 


Gates  and  Crellin  Laboratories  of  Chemistry 
California  Institute  of  Technology,  Pasadena,  Caüf. 


An  understanding  of  the  structural  basis  of  the  difference  in  properties  of 
normal  adult  human  hemoglobin  and  the  abnormal  hemoglobins,  such 
as  sickle-cell-anemia  hemoglobin,  will  be  obtained  only  when  complete 
structure  determinations  of  the  molecules  of  these  substances  have  been 
made.  The  properties  of  hemoglobin  depend  not  only  on  the  sequence  of 
amino-acid  residues  in  the  polypeptide  chains  but  also  on  the  way  in  which 
the  polypeptide  chains  are  folded. 

It  is  probable  that  the  folding  of  the  polypeptide  chains  of  a  protein  is 
in  considerable  part,  perhaps  in  some  cases  entirely,  determined  by  the 
amino-acid  sequence  in  the  chains.  For  example,  it  has  been  shown  by 
Doty,  Moffitt,  and  their  co-workers^ -^-^  that  the  chains  of  poly-a-L- 
glutamic  acid  and  of  poly-y-benzyl-L-glutamate  have  the  configuration,  in 
solution  in  certain  solvents,  of  the  right-handed  a-helix,  and  that  the  mole- 
cules resume  this  right-handed  helical  configuration  after  they  have  been 
unfolded  by  change  in  the  nature  of  the  solvent,  and  then  have  been  re- 
turned to  the  original  solvent  in  which  the  a-helix  is  stable.  It  is  evident 
that  the  interactions  of  the  side  chains  of  the  L-amino-acid  residues  stabilize 
the  right-handed  a-helix  relative  to  the  left-handed  a-helix  for  these  poly- 
peptides. Yang  and  Doty^  have  also  reported  that  the  molecules  of  certain 
proteins — especially  silk  fibroin  and  the  B  chain  of  insulin — can,  in  certain 
solvents,  be  brought  into  the  configuration  of  the  right-handed  a-hehx. 
These  assignments  of  sense  to  the  a-helix,  made  by  application  of  the  theory 
of  optical  activity  developed  by  Moffitt,^  are  probably  more  reliable  than 
the  earher  assignments  made  by  Riley  and  Arndt.  ^ 

There  is  a  possibihty  that  the  difference  in  properties  of  the  abnormal 
hemoglobins  and  normal  adult  human  hemoglobin  can  be  ascribed  en- 
tirely to  the  replacement  of  a  few  amino  acid  residues  by  residues  of  other 

Bps 
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amino  acids.  Such  a  change  in  even  two  or  three  residues  at  the  surface  of 
the  molecule  might  introduce  or  destroy  the  self-complementariness  in  sur- 
face configuration  to  which  is  attributed  the  low  solubility  of  sickle-cell- 
anemia  hemoglobin,  and  the  sickling  of  the  cells  consequent  upon  the 
formation  of  tactoids.  There  is,  however,  some  evidence  that  a  change  in 
configuration  of  the  polypeptide  chains  is  also  involved.  The  principal  evid- 
ence indicating  this  change  in  configuration  is  the  observed  heterogeneity 
of  globin  made  from  normal  adult  human  hemoglobin  and  a  similar  hetero- 
geneity of  the  globin  from  sickle-cell-anemia  hemoglobin,  as  illustrated 
by  the  work  of  Havinga  and  Itano.'  Havinga  and  Itano  reported  that  care- 
fully prepared  globins  of  the  two  kinds  show  a  difference  in  electrophoretic 
mobility  similar  to  that  of  the  hemoglobins,  but  that  slight  denaturation 
causes  the  two  globin  preparations  to  become  heterogeneous,  the  hetero- 
geneity being  of  such  a  nature  as  to  suggest  that  there  are  two  ways  of 
folding  the  polypeptide  chains,  one  characteristic  of  normal  hemoglobin 
and  the  other  of  sickle-cell-anemia  hemoglobin.  It  is  possible,  of  course, 
that  even  a  change  in  a  couple  of  amino  acid  residues  could  effect  a  change 
in  the  way  of  folding  the  chains  in  one  part  of  the  molecule. 

The  difficulties  of  experimental  determination  of  the  complete  molecular 
structure  of  globular  proteins  are  so  great  as  to  make  it  essential  that  every 
possible  aid  be  utilized.  In  the  attacks  on  the  problem  of  protein  structure 
one  of  the  most  valuable  aids  has  been  found  to  be  the  formulation  of  struc- 
tural principles  for  polypeptide  chains.  There  is  now  no  doubt  that  stable 
configurations  of  polypeptide  chains  involve  the  planar  configuration  of  the 
amide  group,  with  distances  aC— C'-l-53  A,  C— 0-1-24  A,  C— N  = 
1-32  A,  N— aC-147  A  (all  ±0-01  A),  and  angles  N— aC— C'  =  110°, 
aC— C— N  =  114°,  O— C— N  =  125°,  and  C— N— aC  =  123°.  The  planarity 
of  the  amide  group  and  the  shortening  of  the  C — N  bond  to  1-32  A  are 

H  H 

\       ../  \  / 

attributed  to  resonance  between  structures       C — N        and        C=N       , 

^  \  /  \ 

O:  :0: 

with  about  60%  and  40%  contributions  respectively,  corresponding  to  40% 
double-bond  character  for  the  C — N  bond.  In  addition,  the  stable  con- 
figurations of  polypeptide  chains  involve  the  formation  of  N — H'--0  bonds, 
with  the  oxygen  atom  close  to  the  N — H  axis  and  the  nitrogen-oxygen  dis- 
tance equal  to  1-79  ±0-10  A.« 

The  trans  configuration  of  the  amide  group  seems  to  be  significantly 
more  stable  than  the  eis  configuration.  At  the  present  time  the  only  known 
peptide  structure  in  which  the  amide  groups  have  the  eis  configuration  is 
that  of  diketopiperazine,^  in  which  the  nature  of  the  covalent  bonds  (a  six- 
membered  ring,  for  the  cyclic  dipeptide)  is  such  as  to  require  this  configura- 
tion. Spectroscopic  evidence  showing  that  the  trans  configuration  in  simple 
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amides  is  significantly  more  stable  than  the  eis  configuration  has  been  sum- 
marized by  Mizushima.^"  From  infrared  absorption  studies  of  amides  and 
TV-substituted  amides  Badger  and  Rubalcava^^  concluded  that  the  trans  con- 
figuration may  well  be  stabilized  by  more  than  2  kcal/mole. 

The  principal  degrees  of  freedom  in  the  polypeptide  chain  are  the  azi- 
muthal  orientations  around  the  N — aC  single  bond  and  the  aC — C  single 
bond.  It  was  suggested  a  few  years  ago  by  Pauling  and  Corey^^  ^jj^t  six 
values  of  the  azimuthal  angle  for  each  of  these  single  bonds  are  more  stable 
than  other  values.  Some  doubt  has  been  thrown  on  this  conclusion  by  the 
observation  that  the  amplitude  of  the  Fourier  term  with  six  minima  for 
rotation  around  the  C — N  single  bond  in  nitromethane  is  extremely  small, 
only  about  0-003  kcal/mole.  At  the  present  time  it  seems  to  be  wise  to  allow 
all  azimuthal  orientations  about  the  single  bonds  in  the  attempted  formula- 
tion of  configurations  of  polypeptide  chains. 

There  exists  now  evidence  indicating  that  an  additional  structural  feature 
for  polypeptide  chains  can  be  accepted.  In  their  recent  review  of  the  configura- 
tion of  polypeptide  chains  in  proteins  Pauling  and  Corey^  stated  their  opinion 
at  that  time  of  the  answer  to  the  question  of  the  orientation  of  the  N — H--0 
hydrogen-bond  axis  relative  to  the  O — C  axis  in  the  following  words: 
'Although  it  might  be  anticipated  from  consideration  of  the  partial  covalent 
character  of  the  H---0  interaction  that  the  favored  orientation  of  the 
N — H---0  hydrogen  bond  about  an  oxygen  atom  of  an  amide  group  would 
be  in  the  plane  of  the  amide  group  and  at  an  angle  of  about  125°  with  the 
C — O  bond,  there  is  little  evidence  that  this  orientation  is  at  all  favored  with 
respect  to  others,  and  at  the  present  time  it  may  be  assumed  that  the  oxygen 
atom  may  form  a  hydrogen  bond  in  any  direction.' 

There  exists,  however,  significant  indication  that  the  stable  angle  between 
the  hydrogen-bond  axis  and  the  O — C  axis  in  polypeptides  is  not  about 
125°,  but  is  close  to  a  straight  angle.  The  evidence  supporting  this  structural 
feature,  an  angle  of  180°  at  the  oxygen  atom,  is  summarized  in  the  following 
paragraphs. 

First,  all  of  the  known  (reasonably  well  substantiated)  structures  for  poly- 
peptide chains  in  nature,  including  synthetic  polypeptides,  have  the  angle 
approximately  180°  for  the  hydrogen  bond  at  the  oxygen  atom.  These  struc- 
tures include  the  a-helix,^^  the  antiparallel-chain  pleated  sheet,^^  the  parallel- 
chain  pleated  sheet,^^  the  structure  of  polyglycine  11,^^  and  the  several 
three-chain  structures  of  collagen  similar  to  those  suggested  by  Crick  and 
Rich.16 

In  addition,  there  are  several  simple  peptides  and  closely  related  sub- 
stances in  which  this  angle  is  nearly  180°.  Among  these  structures  are 
jS-glycylglycine,^''  A^,7V'-diacetylhexamethylenediamine,^^  glycyl-L-tyrosine 
hydrochloride  monohydrate, ^^  and  creatinine;^"  in  all  of  these  structures  the 
deviation  from  a  straight  angle  is  less  than  30°.  There  are  also  several  simple 
peptides  in  which  the  angle  at  the  oxygen  atom  is  as  small  as  120°  to  140° 
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(A^-acetylglycine,2^  A'^jA'^'-diglycyl-L-cysteine  dihydrate,^^  glycyl-L-tryptophan 
hydrate^^),  and  even  some  in  which  the  N — H--0  hydrogen  bond  involving 
the  NH  and  O  of  the  peptide  groups  is  not  formed,  as  for  one  of  the  peptide 
oxygens  in  glycyl-L-asparagine.^*  It  should  not  be  surprising  that  the  mole- 
cules of  a  simple  substance,  in  seeking  the  way  of  arranging  themselves  in 
a  crystal  that  minimizes  the  free  energy  of  the  crystal,  should  occasionally 
have  to  sacrifice  the  most  favorable  circumstances  for  hydrogen-bond  forma- 
tion in  order  to  satisfy  some  other  structural  features  leading  to  stabiUty. 

In  many  of  these  substances  the  N — H---0  hydrogen  bonds  that  are 
formed  by  oxygen  atoms  other  than  those  in  peptide  groups  involve  angles 
of  about  120°,  with  the  atoms  adjacent  to  the  oxygen  atom  coplanar.  Further 
examples  of  this  sort  are  the  carboxyl  oxygens  in  the  amino  acids  glycine,^^ 
alanine,^^  Lg-threonine,^'  and  DL-serine.^^ 

A  simple  theoretical  consideration  can  be  made  that  supports  the  pro- 
posed principle  that  under  ordinary  circumstances  an  oxygen  atom  attached 
to  a  carbon  atom  by  a  double  bond  tends  to  form  hydrogen  bonds  that  lie 
in  the  plane  formed  by  the  carbonyl  group  and  adjacent  atoms,  with  the 
angle  at  the  oxygen  atom  equal  approximately  to  120°,  whereas  in  hydrogen 
bonds  formed  by  a  peptide  group  the  angle  tends  to  be  a  straight  angle.  Let 
us  consider  the  electrons  of  the  oxygen  atom  in  a  carbonyl  group.  There 
are  two  electrons  involved  in  the  double  bond  (shared  with  two  electrons 
of  the  carbon  atom),  and  two  electron  pairs  occupy  the  remaining  two 
Orbitals  of  the  neon  shell  of  the  oxygen  atom.  We  may  discuss  the  deviation 
of  the  electronic  structure  from  spherical  symmetry  by  considering  only  the 
distribution  of  one  electron  in  each  of  the  two  orbitals  for  unshared  pairs 
inasmuch  as  there  is,  as  a  first  approximation,  spherical  symmetry  when 
one  electron  is  placed  in  each  of  the  four  orbitals  of  the  neon  shell.  Also, 
as  a  first  approximation,  we  may  treat  the  orbitals  as  tetrahedral  sp^  or- 
bitals. The  treatment  of  the  double  bond  as  involving  two  bent  tetrahedral 
bonds  is  mathematically  equivalent  to  its  treatment  as  a  sigma  bond  and 
a  pi  bond. 

When  the  orbitals  for  the  shared  pairs  are  taken  as  ^s-\ — j^Pz+^Px  and 

2      V2       ^ 

~s — 7=Pz-^^Px  (the  bond  direction  being  the  x  axis),  it  is  found  that  the 

2      V2        2 

electron  distribution  for  one  electron  in  each  of  these  orbitals,  with  arbitrary 
normalization,  is  given  by  the  expression 

p(,rz)  =  l  +  6  sin2  Öcos2  <^+3  cos^  ö+2\/3  cos  6  (1) 

When  this  is  differentiated  with  respect  to  the  angle  6  and  the  angle  (f>,  it 
is  found  that  there  are  two  maxima  in  the  density,  both  lying  in  the  plane 
determined  by  the  atoms  around  the  carbon  atom  and  at  the  angle  125°  16' 
with  the  C — O  axis. 
In  discussing  the  distribution  of  the  electrons  around  the  oxygen  atom 


1]  FACTORS  IN  PROTEIN  STRUCTURE  21 

of  the  amide  group,  as  in  a  peptide,  we  must  take  into  consideration  the 
contribution  of  structures  in  which  there  is  an  interaction  between  the  ionic 
character  of  the  sigma  bond  and  the  unshared  pair  of  the  oxygen  atom  lying 
in  the  plane  of  the  group.  This  causes  the  nature  of  the  carbon-oxygen  bond 
to  assume  to  some  extent  the  character  of  the  pi  bonds  in  carbon  dioxide. 
In  carbon  dioxide  we  may,  as  a  first  approximation,  describe  each  carbon- 
oxygen  bond  as  a  double  bond  to  which  the  pz  and  the  py  electrons  contribute 
equally  (there  is  also,  of  course,  some  contribution  of  the  triply  bonded  and 
singly  bonded  structures,  as  well).  The  contribution  for  the  bond  in  which 
the  double  bond  has  50%  ttz  character  and  50%  Try  character,  normahzed 
to  the  same  basis  as  for  Equation  1,  is  equal  to 

pQ7r2+^7rJ=4+2V3'cos  d  (2) 

It  is  found  by  differentiating  this  expression  that  the  maximum  is  at  0=0°; 
that  is,  along  the  carbon-oxygen  axis. 

In  the  amide  group  the  ionic  structure  for  the  sigma  bond  between  the 
carbonyl  carbon  and  the  nitrogen  atom  would  effectively  liberate  a  p  orbital 
for  carbon  lying  in  the  plane  of  the  group,  and  permit  a  double  bond  to  be 
formed  with  use  of  the  py  orbital  of  the  oxygen  atom  and  the  pair  of  elec- 
trons written  as  an  unshared  pair  in  the  usual  structural  formulas.  The 
electron  distribution  corresponding  to  a  contribution  by  n%  of  this 
structure,  with  a  contribution  of  (100-«)%  of  the  structure  involving  the 
double  bond  orbitals  perpendicular  to  the  plane  of  the  group,  is  found  on 
differentiating  the  corresponding  expression  for  p  to  have  the  maximum  in 
electron  density  along  the  carbon-oxygen  axis  when  the  double  bond  has 
21%  or  more  -ny  character  and  79%  or  less  ttz  character.  It  seems 
not  unreasonable  to  expect  that  there  is  approximately  this  amount  of  Try 
character  in  the  double  bond  in  the  amide  group,  and  accordingly  that  the 
electron  distribution  for  the  oxygen  atom  in  the  amide  group  is  such  as  to 
favor  a  linear  arrangement  of  the  hydrogen  bond  at  this  atom,  whereas  in 
other  compounds  containing  carbon-oxygen  double  bonds  the  electron  dis- 
tribution has  its  maximum  in  the  plane  of  the  ring,  with  the  angle  approxi- 
mately equal  to  the  tetrahedral  angle. 

From  these  considerations  we  reach  the  conclusion  that  an  additional 
structural  principle  can  be  formulated  for  folded  polypeptide  chains  ;  namely, 
that  the  stable  configurations  will  tend  to  be  those  in  which  the  N — H--0 — C 
hydrogen  bond  is  not  only  linear  with  respect  to  the  three  atoms  N — H--0, 
but  also  with  respect  to  the  carbon  atom  adjacent  to  the  oxygen  atom,  with 
the  bond  angle  at  the  oxygen  atom  close  to  180°.  This  structural  principle 
may  be  of  value  in  the  search  for  additional  polypeptide-chain  structures. 
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1.     INTRODUCTION 

In  a  series  of  recent  papers  by  Hvidt,  Johansen,  Vaslow,  Berger,  and  my- 
seifi.2.3.4.5.6,7,8.9  ^  study  was  made  of  the  rate  of  deuterium  exchange 
between  ordinary  water  and  proteins  or  peptides  in  which  all  the  easily  ex- 
changeable hydrogen  atoms,  i.e.  those  bound  to  oxygen,  nitrogen,  and  sul- 
phur atoms,  had  been  replaced  by  deuterium  atoms.  The  conditions  under 
which  this  rate  of  exchange  was  studied  were  varied  within  rather  wide  limits, 
the  temperature  e.g.  was  varied  from  0°  to  40°C,  pH  from  2  to  8.  In  some 
cases  buffer,  in  others  denaturing  agents  like  urea  or  guanidinium  chloride 
were  added. 

The  results  may  be  summarized  as  follows  : 

1.  Neglecting  the  possibility  that  the  exchange  of  one  deuterium  atom 
may  influence  the  rate  of  exchange  of  another,  the  rate  curve  for  any  peptide 
or  protein  may  be  represented  by  a  sum  of  first  order  terms,  viz.  : 

n„-n-=^me-ß^';  'Em=n^  (1) 

where  n^  is  the  total  number  of  exchangeable  deuterium  atoms  per  peptide 
or  protein  molecule  and  m  the  number  of  atoms  within  any  group  charac- 
terized by  the  same  rate  constant  ßt.  The  quantity  n  is  the  average  number 
of  exchanged  deuterium  atoms  at  the  time  t.  The  half-time  of  the  rate  group 
/  is  given  by 

ln2    0-6932  .^^ 


('.>' 


ßi  ßi 


As  everybody  knows,  most  kinetic  data  (and  ours  too)  can  be  represented 
by  such  exponential  series  and  the  number  of  constants  ßt  required  is  the 
smaller,  the  less  accurate  the  experiments. 

2.  The  deuterium  atoms  bound  to  oxygen  and  nitrogen  in  the  end  groups 
or  side  chains  of  peptides  and  proteins  seem  to  exchange  fast  in  most  cases, 
viz.  with  a  half-time  of  less  than  0-5  min.  at  pH  3  and  0",  i.e.  with  a  velocity 
which  is  too  high  to  be  picked  up  by  our  methods,  a  point  to  be  discussed 
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later.  In  a  few  cases,  where  the  side  chains  are  engaged  in  hydrogen  bond- 
ing, lower  reaction  rates  might  be  observed,  but  we  have  been  unable  to 
distinguish  such  cases  from  those  discussed  in  the  following  paragraph. 

3.  The  deuterium  atoms  bound  to  nitrogen  in  the  backbone  of  the  peptide 
chain  show  a  highly  varying  rate  of  exchange.  Half-times  from  less  than 
0-5  min.  to  more  than  24  hours  have  been  observed  at  pH  3  and  0°.  Since 
the  fast  reactions  are  primarily  found  in  unfolded,  denatured,  proteins  and 
peptides,  whilst  the  slow  reactions  are  predominant  in  native  molecules,  it 
is  natural  to  ascribe  the  sluggishness  of  the  reaction  in  the  latter  case  to 
internal  hydrogen  bonding  in  the  secondary  structure  of  these  native  mole- 
cules, viz.  to  the  presence  of  groups  of  the  type 

I  I 

C=0-  -D— N  (i) 

I  I 

where  the  deuterium  atom  is  less  available  for  exchange.  It  is  to  be  expected 
that  the  stronger  the  hydrogen  bond  in  question,  the  slower  the  observed 
exchange,  i.e.  the  smaller  ßi. 

4.  The  rate  of  deuterium  exchange  is  strongly  dependent  upon  tempera- 
ture and  generally  the  more  so,  the  lower  the  rate. 

5.  The  rate  of  deuterium  exchange  varies  with  pH,  the  most  general 
feature  being  that  it  increases  with  increasing  pH  from  a  pH  about  3.  In 
certain  cases  this  may  be  due  to  a  change  in  the  stabiHty  of  the  hydrogen 
bonds  caused  by  ionization  or  desionization  of  the  protein  molecule  but 
in  the  best  investigated  case,  which  I  shall  discuss  in  a  moment,  the  deter- 
mining factor  seems  to  be  the  rate  of  the  exchange  process  proper  which  is 
strongly  catalysed  by  hydroxyl  ions. 

6.  Addition  of  denaturing  agents  generally  increases  the  rate  of  exchange 
in  agreement  with  the  fact  that  deuterium  atoms  released  from  internal 
hydrogen  bonds  of  the  secondary  structure  may  react  faster  with  the  solvent. 

The  rapidly  growing  literature  on  the  secondary  structure  of  proteins  and 
polypeptides  has  made  it  likely  that  the  a-helix  of  Pauling  and  Corey  is  an 
essential  element  of  this  structure^"-^^-^^-^^-^^'^^,  and  the  bonds  of  type 
(i)  may  therefore  in  many  cases  be  regarded  as  situated  in  a-helices.  One 
of  these  cases  is  represented  by  poly-DL-alanine  (PDLA)  which  in  its  soluble 
form,  according  to  Elliott, ^°-^^  shows  an  infrared  band  at  1662  cm~^,  con- 
sidered to  be  characteristic  of  the  internal  CO-HN-bond.  The  question 
whether  this  polypeptide  in  aqueous  solution  maintains  its  helical  configura- 
tion is  of  course  of  considerable  interest,  and  since  PDLA  contains  no  ex- 
changeable hydrogen  atoms  in  its  side  chains  and  furthermore  is  unassociated 
in  aqueous  solution,  it  appears  to  be  an  ideal  object  for  the  study  of  the 
stability  of  helical  hydrogen  bonds  by  means  of  the  deuterium  exchange 
method. 
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In  the  following  a  discussion  of  some  of  the  results  obtained  with  PDLA 
will  therefore  be  given. 

2.    THE    PDLA  PREPARATION 

The  preparation  of  PDLA  used  was  synthesized  in  Katchalski's  laboratory 
and  was  reported  to  contain  an  average  of  30  residues  per  molecule.  The 
material,  which  contained  pyridine,  was  purified  by  dissolution  in  dilute 
NaOH,  lyophilization  over  P2O5,  redissolution  in  an  equivalent  amount  of 
HCl,  and,  finally,  lyophilization  after  removal  of  a  small  quantity  of  undis- 
solved gel.  The  purified  PDLA  dissolved  in  water  to  a  clear  solution.  The 
pH  of  the  solution  was  2-8,  and  the  peptide  was  therefore  present  mainly 
as  a  positive  charged  ion. 

A  simple  PDLA  molecule  containing  30  residues  has  a  molecular  weight 
of  2151  and  a  nitrogen  content  of  19-53%.  The  ratio  of  a-amino  groups  to 
total  nitrogen  atoms  should  be  1 :  30.  The  present  preparation  when  dis- 
solved in  0-lN-KCl  at  22°C  gave  a  titration  curve  which  showed  the  presence 
of  one  amino  group  (pK  8-1)  per  36-7  nitrogen  atom,  a  figure  which  is 
compatible  with  the  assumed  average  chain  length,  if  end  groups  of  the  type 

H00CCH(CH3)NHC0NH— 

(see  Sela  and  Berger^^)  replace  a-amino  groups  in  about  20%  of  the  mole- 
cules. PDLA  molecules  with  such  end  groups  have  a  molecular  weight  of 
2266  and  a  nitrogen  content  of  19-16%,  and  the  average  nitrogen  content 
of  our  preparation  should  therefore  be  19-46%.  The  experimental  value 
19-0%  (corrected  for  NaCl  and  acid  content)  is  in  fair  agreement  with  the 
theoretical  one.  The  molar  concentration,  Cs,  of  PDLA  in  a  given  solution 
was  calculated  from  the  nitrogen  content,  and  the  number  of  exchangeable 
hydrogen  atoms  per  molecule  was  assumed  to  be  29  (backbone) +4  (end 
groups)  at  pH  2-8,  i.e.  a  total  of  33. 

Since  the  hydrogen  exchange  was  measured  in  0-1 M  citrate  buffers  which 
were  0-1m  with  respect  to  glycerol  (cf.  '),  the  sedimentation  constant  of 
PDLA  was  determined  at  pH  3  in  this  medium.  The  sedimentation  con- 
stant corrected  to  water  at  20°C  was  found  to  be  represented  by  the  equation 

^20. 10i3=0-4-0-07  x(%  PDLA) 

in  the  concentration  range  0-5  to  2%,  which  is  compatible  with  a  molecular 
weight  of  2000  (D^IOAQ-'',  F~0-74),  Hence  there  is  no  indication  of 
significant  association.  The  question  of  the  distribution  of  molecular  sizes 
in  our  preparation  is  unsettled  but,  since  the  boundary  spreading  in  the 
centrifuge  appeared  to  be  normal,  it  may  be  inferred  that  the  size  distribution 
is  within  rather  narrow  limits. 

3.    EXCHANGE    METHOD 
The  method  used  in  all  cases  was  Method  2  described  in  (4)  and  (7);  i.e. 
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PDLA  was  first  deuterated  completely  and  the  rate  of  back-exchange  with 
ordinary  water  was  followed. 

A  volume  of  150  /m1.  of  a  2%  PDLA  solution,  pH  2-8,  was  lyophilized 
and  dried  in  a  vacuum  over  P2O5.  Next  150  jul,  of  99-7%  DgO  was  added 
and  allowed  to  exchange  with  the  peptide  at  38-6°C  for  4-15  hr.  The  deu- 
terated peptide  was  freed  from  D2O  by  cryosublimation  and  drying  for  4  hr. 
at  68°C,*-'  after  which  time  150  [A.  of  01  m  citrate  buffer  of  the  desired 
pH  and  temperature  were  added,  and  samples  of  15  /nl.  were  taken  out  at 
intervals  for  measurement  of  the  deuterium  exchange  (cryosublimation  and 
determination  of  the  density  of  the  water  in  the  gradient  tube.^-*-') 

It  should  be  noted  that  back-exchange  is  measured  here  in  buffers  in 
order  to  fix  pH  in  the  otherwise  poorly  buffered  PDLA  solutions.  We  also 
call  attention  to  the  fact  that  the  solid,  the  dissolution  of  which  is  the  initial 
step  in  the  kinetic  experiment,  is  the  same  in  all  cases  (obtained  by  lyo- 
philization  of  2%  PDLA  solutions  at  pH  2-8),  and  that  consequently  pH 
is  determined  by  the  composition  of  the  buffers  added.  The  value  obtained 
for  the  total  exchange  should  therefore  be  the  same  in  all  experiments 
(«»=33). 

4.  RESULTS  OF  THE  EXCHANGE  EXPERIMENTS 
In  each  kinetic  experiment  the  total  exchange,  w„,  at  the  given  conditions 
was  determined  by  placing  from  one  to  two  15  jul.  samples  at  38-6°C  for 
4-15  hr.  and  determining  the  deuterium  content  of  the  water.  As  the  mean 
of  24  determinations  under  widely  different  conditions,  a  value  of  33-2  was 
found  with  a  standard  deviation  of  0-4,  a  result  which  is  in  good  agreement 
with  the  above  assumptions.  Nevertheless  the  «-values  of  a  given  kinetic 
run  were  corrected  by  multiplying  them  by  the  ratio  33/(«„)a;,  where  (n„)x 
is  the  value  for  n^  determined  in  this  particular  run.  A  rough  correction 
for  losses  during  lyophilization  and  incomplete  removal  of  D2O  is  obtained 
in  this  way. 

Figs.  1  and  2  show  the  kinetic  results.  The  curves  drawn  are  theoretical 
ones  and  will  be  explained  in  the  following  section. 

5.  THEORY 
It  is  obvious  that  the  curves  in  Figs.  1-2  may  be  satisfactorily  described  by 
Eq.  1  (see  (4))  if  the  number  and  values  of  ßi  can  be  adjusted  arbitrarily 
to  fit  the  curves.  The  variations  of  ßi  with  a  given  parameter  X  (pH)  may 
be  most  adequately  determined  by  the  variation  of  t  at  constant  extension 
of  the  reaction  (constant  n).  From  Eq.  (1)  we  get 

yS^t    _0.t 

/8lnt\   ^      "hX^       _     8ßil8X  ,. 

\8xln~       ^iSiC-^»'  ~         ßi  ^  ^ 

If  now  the  ßi  may  be  regarded  as  products  of  a  general  factor,  B,  which 
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Fig.  1 .  Deuterium  exchange  of  PDLA  at  0°C. 
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Fig.  2.  Deuterium  exchange  of  PDLA  at  0°C. 
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is  a  function  of  X,  and  an  individual  one,  ai,  which  is  not,  we  get  from 
Eq.  (3) 

'8lnt\   ^_8lnB 
8X/n         8X 
for 


(■ 


(4) 


ßi^Bai     and      3^=<^iFy 


(5) 


In  our  attempt  to  explain  the  rate  curves  at  0°C  and  different  pH  we  have 
considered  the  simple  reaction 

R  — C  R— C-O    -HOH  R— C  =  0-  -HOH 


:  +2H2O  ;z::: 


(6) 


Ri— N  Ri— ND-  -OHa+QH         Rj— NH-OHa+QD 

A  Ai  A2 

where  QH  is  a  hydrogen  donor  or  acceptor  (H3O,  HgO,  or  OH-)  present 
in  constant  concentration  at  constant  pH.  We  have  here  taken  the  primitive 
attitude  to  assume  that  the  D-atom  in  the  internal  hydrogen  bond  exchanges 
only  by  way  of  the  'hydrolysed'  form  (Ai),  the  instabihty  of  the  bond,  and 
therefore  the  'availability'  of  the  deuterium  atom,  being  measured  by  the 
ratio  of  the  concentrations  of  forms  Ai  and  A  at  equilibrium. 
Using  the  stationarity  principle,  we  obtain  the  rate  constant 


ß=UQ\i]  \\ 


Â:_i  ,  k^[QYl]\ 


ki 


ki     j 


(7) 


and  if  the  exchange  reaction  proper  (Aj >  A2)  is  considered  the  bottle- 
neck of  the  over-all  reaction,  Eq.  (7)  is  transformed  into 

ß^keXk^lQU] 


where 


1 


1 


1  + 


^_l       l+eàFrl^T 


(8) 
(9) 


denoting  by  AFr  the  free  energy  of  'hydrolysis'  of  the  hydrogen  bond  in  A. 
Assuming  now  that  the  a-helix  is  a  predominant  secondary  structure  of 
PDLA  in  aqueous  solution,  we  may  utilize  Eq.  (9)  in  the  following  way: 
The  complete  helix  of  our  PDLA  contains  a  maximum  of  26  internal  hydro- 
gen bonds  with  four  end-group  and  three  terminal  imide  hydrogen  atoms 
exposed  to  water.  Adopting  the  treatment  given  by  Schellman^'-^^-^^  and 
assuming  that  the  opening  of  any  hydrogen  bond  along  the  hehx  takes  place 
exclusively  through  an  unwinding  of  the  helix  from  the  ends,  we  obtain 

1  -y^^-i 


ßi 


l-yl^ 


.y.k.iQU] 


(10) 
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for  the  rate  constant  assigned  to  the  two  bonds  situated  i  places  from  the 
ends.  The  quantity  y  is  e'^^ri^'^  and  i  may  assume  values  from  1  to  13, 
the  two  halves  of  the  helix  being  considered  symmetrical  (/2i=2),  The  cal- 
culations leading  to  Eq.  (10)  are  rather  trivial  and  will  be  given  in  greater 
detail  elsewhere.  The  terms  involving  ßi  for  the  seven  deuterium  atoms 
which  are  not  hydrogen-bonded  were  assumed  to  be  negligible. 

It  will  appear  from  Eq.  (10)  that  the  more  positive  AVr,  the  more  the 
ßi  will  decrease  toward  the  middle  of  the  helix  (/=13).  It  was  therefore 
tempting  to  apply  Eq.  (10)  to  our  results  and  see  whether  reasonable  values 
for  A¥r  and  k^y^  [QH]  could  be  obtained  on  the  basis  of  these  very  simple 
assumptions.  It  became  immediately  clear  that  it  was  impossible  to  fit  this 
theory  to  the  experimental  data  in  an  acceptable  way.  In  order  to  satisfy 
the  condition  that  deuterium  atoms  fully  exposed  to  water  (7  in  our  case) 
should  exchange  with  a  half-time  too  small  to  be  measured,  A¥r  has  to  be 
given  a  high  positive  value  so  that  hydrogen  bonds  only  a  few  places  away 
from  the  ends  are  suitably  inhibited  in  their  exchange.  This  choice  of  zlFr 
caused,  however,  the  ßi  of  bonds  nearer  to  the  middle  of  the  helix  to  become 
too  small  so  that  the  calculated  rate  curves  fell  off  too  rapidly. 

This  negative  result  led  us  to  consider  the  possibiUty  of  breaks  in  the 
interior  of  the  helix  as  a  means  to  increase  the  value  of  the  internal  ßi, 
and  we  have  adopted  the  treatment  of  Schellman,^^  p.  236,  Eq.  (9),  putting 

(A)  breafc  =  Ä:2[QH]  .  3  .  g-^^öreafc/^^  (1 1) 

and  taking  into  account  that  a  particular  hydrogen  atom  becomes  exposed 
in  three  different  breaks  and  that  breaks  occur  not  only  in  completely  folded 
helices,  but  also  in  partly  unfolded  ones.  Formula  (11)  is  correct  only  if 
AY'r  is  large  so  that  the  concentration  of  intact  helices  is  high,  and  even  then 
it  is  a  crude  approximation.  A  more  exact  expression  could  be  derived,  but 
it  is  hardly  worth  the  effort  to  introduce  these  highly  complex  corrections. 

The  possibility  that  deuterium  ions  are  pulled  out  one  by  one  from  the 
middle  of  the  hehx  and  replaced  by  protium  ions  cannot  be  excluded  entirely. 
Since,  however,  the  mechanism  of  this  process  is  not  clear  to  us — except 
for  the  belief  that  it  may  lead  to  a  term  of  the  same  type  as  Eq.  (1 1),  i.e. 
to  ßi  values  which  are  approximately  independent  of  i — we  shall  postpone 
the  discussion  of  this  point  to  a  later  publication. 

The  curves  in  Figs.  1  and  2  are  calculated  on  the  basis  of  Eqs.  (11)  and 
(10)  combined,  viz., 

Bi={e-iàP,lRTj^2>.  e-^^6rea*/^^}Ä:2[QH]  (12) 

or 

i3ï={(0-376)»+000572}A:2[QH]  (13) 

neglecting  higher  powers  of  y  in  Eq.  (10).  The  numerical  value  introduced 
for  A¥r  was  +530  cal.  and  for  ^F&reaÄ;  =  +3400  cal.,  a  somewhat  lower 
value  than  that  adopted  by  Schellman  (4500),  who  neglected  the  entropy 
increase  in  the  process  of  breaking.  The  calculation  is  in  any  case  uncertain 
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considering  especially  the  additional  stabilization  of  the  helix  by  non-polar 
bonds  discussed  below.  In  fact  it  mainly  serves  to  show  the  necessity  of 
introducing  a  third  constant  besides  y  and  kz  in  the  description  of  the 
kinetic  curves.  It  will  be  observed  that  Eq.  (12)  is  of  the  form  Bxat  (Eq. 
(5)),  so  that  the  whole  set  of  curves  in  Figs.  1  and  2  is  determined  by  Eq.  (13) 
introducing  the  following  expression  for  ä:2[QH]  : 


^2[QH]  =  6-2. 10i«[OH-]+4-6. 102[OH3+]+0-72 


(14) 


It  appears  therefore  from  Figs,  1  and  2  that  the  exchange  is  'catalysed' 
both  by  hydroxyl  ions,  hydrogen  ions,  and  water  ([HgO]  is  put  equal  to  1). 

In  the  present  stage  of  our  investigation  we  shall  refrain  from  going  more 
fully  into  the  mechanism  of  the  exchange  reaction  proper,  but  a  few  words 
may  be  said  about  Eq.  (14)  which  ought  to  apply  generally  to  the  exchange 
of  deuterium  atoms  in  ND  groups  which  are  exposed  to  water  (k  =  1).  At 
the  rate  minimum,  pH'--'2-8,  Â:2[QH]  assumes  the  value  1-7  corresponding 
to  a  half-time  of  0-4  min.,  and  the  rate  is  therefore  so  high  that  a  quantita- 
tive measurement  by  our  method  is  impossible. 

Table  1 

DEUTERIUM  EXCHANGE  OF  GLYCYLGLYCYLGLYCINE  AT  0°C 

(«„  =  5) 


atpH3 

at  pH  5-6 

/ 

t 

(min.) 

n 

(min.) 

n 

0-8 

4-61 

2-0 

4-93 

2-3 

4-72 

4-4 

5-02 

4-0 

4-85 

6-3 

5-01 

7-0 

4-89 

48-3 

4-96 

27-3 

5-03 

57-3 

5-01 



— 

65-2 

4-96 

In  general  the  process  of  dissolution  of  the  lyophilized,  deuterated  pep- 
tide or  protein  takes  about  15-30  seconds,  and  the  first  15  /xl.  sample  for 
deuterium  oxide  estimation  is  taken  about  1-2  minutes  after  the  start  of 
the  kinetic  run.  Our  experiments  with  the  short  peptide  DL-leucyltriglycine^ 
could  therefore  only  demonstrate  that  all  six  of  the  exchangeable  hydrogen 
atoms  in  this  substance  exchanged  within  one  minute  at  0°.  The  pH  was, 
however,  in  the  neighbourhood  of  6  in  this  case,  and  in  view  of  the  strong 
dependence  upon  pH  found  for  the  exchange  of  PDLA,  a  reinvestigation 
of  the  exchange  of  short  peptides  at  pH  2-5-3'5  and  0°C  was  indicated. 
Such  a  reinvestigation  has  been  carried  out  by  Dr.  Wierzchowski  in  our 
laboratory  using  glycylglycylglycine  (GGG)  as  test  substance.  Table  1  shows 
the  results  of  two  typical  kinetic  runs  carried  out  in  essentially  the  same 
way  as  the  PDLA  experiments.  It  appears  from  this  table  that  the  exchange 
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reaction  is  faster  at  pH  5-6  than  at  pH  3,  but  that  even  in  the  latter  case 
about  85%  of  the  2  ND-groups  have  reacted  with  water  in  the  course  of 
2  minutes.  This  finding  is  compatible  with  a  half-time  of  0-5  min.  for  the 
reaction  of  these  groups  at  pH  3,  but  as  an  argument  in  favour  of  our  value 
for  ^2[QH]  it  is  not  very  strong  from  an  experimental  point  of  view,  and 
is  further  considerably  weakened  by  the  fact  that  about  15%  of  ND-groups 
apparently  exchange  at  a  lower  rate.  This  phenomenon  requires  further 
study  but  may  be  due  to  association  of  GGG  in  solution. 

If,  however,  the  value  530  cal.  is  adopted  for  AFr,  it  is  possible  to  cal- 
culate the  degree  of  unwinding  of  PDLA  in  aqueous  solution.  Some  of  the 
calculations  and  results  are  summarized  in  Table  2,  where  the  double  in- 
dices refer  to  the  two  ends  of  the  peptide  (cxy  is  the  concentration  of  a 
molecule  with  x  bonds  opened  at  one  end,  and  y  bonds  at  the  other). 

Table  2 
COMPOSITION    OF    PDLA-SOLUTION 

For  explanation  see  text 


Type  of  molecules 

Relative 
concentration,  % 

Number  of  hydrogen 
bonds  opened 

Concentration  of  different  types 
of  molecules 

0 
1 
2 
3 
4 
5 

Co  (intact  helix) 
Coi+Cio=2yco 
Cii+Co2+C2o=3y2co 

C22  +  Cl3  +  C31  +  Co4  +  C4o  =  5y4Co 
C23  +  C32  +  Cl4+C41  +  Co6  +  C50  =  6y5Co 

38-9 

29-3 

16-5 

8-3 

3-9 

1-8 

total     98-7 

Although  these  figures  should  be  accepted  with  proper  reservation  con- 
sidering their  highly  approximate  character,  they  suggest  an  interesting 
quahtative  picture  of  the  state  of  PDLA  in  aqueous  solution.  99%  of  the 
molecules  have  more  than  21  intact  hydrogen  bonds  out  of  26,  and  PDLA 
may  therefore  be  regarded  as  being  essentially  in  the  helical  form.  It  is  also 
inherent  in  the  considerations  leading  to  the  value  suggested  for  AFr  that 
the  rate  of  opening  and  closing  of  the  hydrogen  bonds  by  unwinding  the 
helix  is  very  high,  corresponding  to  a  low  free  energy  of  activation.  A  mole- 
cular structure  of  this  kind  may  be  said  to  possess  a  high  motihty  (a  term 
suggested  to  the  author  at  the  Gordon  Conference  1957)  and  any  qualita- 
tive distinction  between  a  native  and  a  denatured  state  in  aqueous  solution 
is  meaningless. 

In  a  previous  paper*  on  the  deuterium  exchange  of  insulin,  a  different 
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attitude  was  taken  to  the  question  of  the  relative  velocities  of  the  reactions 

A >-Ai,  and  Ai >>A2  (see  scheme  (6)).  It  was  assumed  that  Aj >'A2 

was  the  fast  reaction  so  that  Eq.  (7)  assumed  the  form 

ß=k,  (15) 

and  it  was  to  ki,  the  rate  constant  for  the  unfolding,  that  high  values  of  the 
energies  of  activation  zlH+  and  ZlF+  were  assigned.  The  truth  probably  lies 
somewhere  between  the  assumption  leading  to  Eq.  (8)  and  to  Eq.  (15).  In 
the  case  of  PDLA  the  maximum  free  energy  of  unfolding  the  hehx  is  that 
of  breaking  in  the  middle,  viz.,  3000-5000  cal.,  and  AF'^break,  the  free  energy 
of  activation  of  the  breaking,  must  be  at  least  that  high  and  probably  con- 
siderably higher.  From  the  first  term  on  the  right  side  of  Eq.  (14)  the  standard 
free  energy  of  activation  of  the  hydroxyl  ion  catalysed  exchange  may  be 
calculated  from  the  theory  of  absolute  reaction  rates.  The  value  obtained, 
viz.  JF!|lea;cA=4700,  indicates  that  somewhere  in  the  alkaline  region  there 
is  a  definite  chance  that  k^  x  [QH]^/:i.  In  the  case  of  insulin,  where  the 
breaking  of  the  helices  has  to  involve  much  greater  energies,  if  our  treat- 
ment used  for  PDLA  applies  here,  the  point  where  k2[QH]^ki  may  be 
reached  at  lower  pH.  In  agreement  with  this  general  picture  the  overall 
activation  energy  for  the  exchange  of  PDLA  is  found  to  be  considerably 
lower  than  that  found  for  insulin.  ^-^  The  portion  of  the  insulin  molecule^ 
in  which  the  ND-groups  have  half-times  of  exchange  of  24  hours  or  more 
at  0°  may  then  be  said  to  have  low  motility.  The  easy  way  of  exchange  by 
unwinding  the  helix  is  blocked  by  S-S  bonds  and  the  hard  one  by  breaking 
the  hehx  is  made  harder  by  stabilizing  interaction  between  sidechains.^-^^ 

Native  ^-lactoglobulin^*^  in  which  about  100  hydrogen  atoms  out  of  550 
do  not  exchange  at  all  during  several  days  at  38°C  at  pH  5-4,  represents 
a  class  of  proteins  with  very  low  motility,  i.e.  low  ki,  but  not  necessarily 
low  kilk-i.  Significant,  irreversible  denaturation  occurs,  if  at  higher  tem- 
peratures (50-60°C)  the  firmly  bound  hydrogen  atoms  are  forced  to  ex- 
change. Some  parts  of  the  ^-lactoglobulin  molecule  may  therefore  be  said 
to  be  kinetically  stable  but  thermodynamically  unstable  at  lower  tempera- 
ture, and  a  relatively  clear  distinction  between  native  and  denatured  forms 
may  be  made  here.  Unfortunately  the  interesting  denaturing  effect  of  high 
concentrations  of  urea  at  0°^^  could  not  be  studied  by  back-exchange  kine- 
tics, because  ^-lactoglobulin  could  not  be  fully  deuterated  without  damage 
to  the  protein.  Otherwise  experiments  of  this  kind  would  have  served  to 
throw  considerable  light  on  the  process  of  denaturation. 

It  remains  now  to  find  an  explanation  of  the  fact  that  PDLA  exchanges 
so  relatively  slowly  with  water  while  the  A-chain  of  insulin  and  even  oxi- 
dized ribonuclease  react  almost  instantaneously.  As  pointed  out  by  Schell- 
man^^  and  by  Harrington  and  Schellman,^^  the  tertiary  structure  plays  a 
very  important  role  in  the  stabilization  of  hehces,  which  when  'naked'  seem 
to  be  very  loose  with  a.  AFr  close  to  zero.  Since  PDLA  is  unassociated  in 
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aqueous  solution,  the  stabilizing  factor  must  be  an  intrinsic  one.  It  is  there- 
fore natural  to  think  of  the  methyl  side  groups  which,  if  there  is  a  random 
distribution  of  d  and  l  forms  along  the  chain,  will  protrude  in  two  different 
directions  from  the  core  of  the  helix,  and  so  that  a  quarter  of  them  may 
form  close  pairs  over  the  gap  between  two  turns  of  the  helix.  The  distance 
between  such  methyl  groups,  one  reaching  down  from  a  given  a-carbon 
of  the  hehx,  the  other  reaching  up  from  an  a-carbon  in  the  next  turn 
(separated  from  the  first  a-carbon  by  3  peptide  groups  and  2  a-carbons), 
is  3-6  A.  reckoned  from  center  to  center  of  the  /3-carbon  atoms.  Since  the 
average  distance  between  the  molecules  in  liquid  methane  at  —  164°C  is  of 
the  order  of  4  Â.,  there  is  reason  to  believe  that  a  hydrophobic  or  nonpolar 
bond  may  be  formed. 

If  we  assume  that  AFr  for  the  naked  helix  is  zero,  we  shall  have  to  adopt 
the  value  of  4x(-f530)^ — f2000  cal.  for  the  contribution  of  each  nonpolar 
bond  to  the  set  of  AFr  of  PDLA  (since  only  one-fourth  of  the  group  form 
pairs).  With  a  random  distribution  of  the  pairs,  it  may  be  permissible  as 
a  first  approximation  to  spht  the  contribution  in  four  parts  of  +530  assigned 
equally  to  four  hydrogen  bonds.  An  exact  treatment  would  involve  a  com- 
plete statistical  analysis  and  is  therefore  extremely  complex. 

It  is  difiicult  to  say  whether  the  value  +2000  cal.  is  a  reasonable  one  or 
not.  Due  to  the  small  additional  entropy  change  involved  in  the  formation 
of  the  nonpolar  bond  in  the  helix,  we  may  here  have  to  do  with  a  AH  value. 
Figures  of  the  order  1  to  2  kcal,  have  been  suggested  for  nonpolar  bond 
strengths  of  CH2  groups, ^^^ 2^'^*  but  it  seems  that  very  little  has  been  done 
to  determine  experimentally  the  thermodynamic  functions  for  the  forma- 
tion of  nonpolar  bonds  in  water.  We  shall  therefore  leave  the  question  open. 

The  assumption  concerning  the  internal  stabilization  of  PDLA  by  non- 
polar  bonds  is  supported  by  the  fact  that  poly-DL-serine  of  similar  chain 
lengths  (25  residues)  exchanges  'instantaneously'.  It  is  apparently  contra- 
dicted by  the  recent  findings^*-^^'^^  that  polypeptides  of  L-amino  acids  have 
a  tendency  to  form  righthanded  helices,  the  stability  of  which  decreases 
with  increasing  admixture  of  D-amino  acids.  Similarly  D-amino  acids  have 
a  preference  for  lefthanded  hehces.  However,  these  important  results  were 
obtained  with  poly  amino  acids  having  more  bulky  sidechains  than  PDLA, 
so  the  question  is  not  quite  settled.  It  is  clear,  however,  that  our  preparation 
must  contain  equal  quantities  of  righthanded  and  lefthanded  helices;  the 
question  is  only  whether  D-alanine  predominates  in  the  lefthanded  helices 
and  L-alanine  in  the  righthanded  ones,  in  which  case  the  frequency  of  methyl 
pairs  along  the  hehces  may  be  greatly  reduced.  This  question  may  be  solved 
by  digesting  our  sample  with  leucylaminopeptidase  or  carboxypeptidase, 
the  action  of  which  will  stop  whenever  a  D-alanyl  group  is  encountered. 
In  the  extreme  case,  in  which  the  preparation  consists  of  pure  poly-L-alanine 
(righthanded  helix)  and  pure  poly-D-alanine  (lefthanded  helix),  these  en- 
zymes should  spUt  half  of  the  peptide  bonds  (14-15  per  mole).  If,  however, 

Cps 
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the  D-  and  L-forms  are  randomly  distributed,  only  1/2  to  1  peptide  bond 
per  mole  will  be  opened.  Preliminary  experiments  have  given  results  which 
are  in  favor  of  a  random  distribution. 

The  author  is  indebted  to  J.  Schellman,  W.  Kauzmann,  and  M.  Ottesen  for  valuable 
discussions. 
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MACROMOLECULAR  CONFIGURATIONS 

Any  long  chain  molecule  can  acquire  a  very  large  number  of  possible  spatial 
configurations  as  a  result  of  rotation  about  single  bonds.  Schematic  dia- 
grams representing  some  of  the  possible  configurations  of  a  linear  polymer 
without  branching  points  or  cross-links  are  shown  in  Fig.  1 .  We  are  interested 
in  this  paper  in  asking  which  of  these  or  other  configurations  will  actually 
represent  a  given  macromolecule  in  solution.  The  answer  will  clearly  depend 
on  the  affinity  of  the  segments  of  the  macromolecule  for  one  another,  the 
affinity  of  solvent  molecules  for  one  another,  and  the  affinity  of  chain  seg- 
ments for  solvent  molecules.  It  is  profitable  to  define  three  extreme  situa- 
tions which  may  arise. 

(1)  Random  coiling. '^'^  Molecules  are  said  to  be  randomly  coiled  if  all  pos- 
sible configurations  have  the  same  energy.  In  that  event,  all  spatial  con- 
figurations, such  as  those  of  Fig.  1 ,  occur  with  equal  probabihty.  Randomly 


(a) 


(e) 

Fig.  1 .  Schematic  representation  of  possible  configurations  of  an  unbranched  polymer 
chain.  The  fully  extended  length  is  the  same  for  each  configuration. 
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coiled  macromolecules  are  ordinarily  described  in  terms  of  an  average 
configuration.  Fig.  \c  may  be  taken  as  a  schematic  representation  of  this 
average  configuration.  Exactly  random  coiling  can  occur  only  if  the  reac- 
tion, 2  segment-solvent  neighboring  pairs  ->  1  segment  —  segment  neighbor- 
ing pair  +  1  solvent  —  solvent  neighboring  pair,  has  exactly  zero  free  energy 
change,  a  situation  which  can  arise  only  fortuitously  ('ideal'  solvent*).  The 
class  of  randomly  coiled  polymers  in  solution  is  therefore  ordinarily  defined 
so  as  to  include  polymer-solvent  systems  in  which  there  are  slightly  more 
('good'  solvent)  or  slightly  fewer  ('poor'  solvent)  segment-solvent  contacts  than 
would  be  predicted  on  the  basis  of  statistics  alone.  The  corresponding  aver- 
age configurations  can  be  described  schematically  as  moving  from  Fig.  \c 
towards  Fig.  \d  or  Fig.  \b,  respectively.  Most  of  the  common  non-polar 
polymers,  dissolved  in  non-polar  solvents,  are  randomly  coiled  if  this  more 
flexible  definition  of  the  term  is  used. 

(2)  Helical  rods.  If  the  predominant  attractive  force  in  a  polymer-solvent 
system  is  a  specific  attraction  between  a  particular  atom  of  each  segment 
of  a  chain  molecule  for  another  particular  atom  of  another  segment,  then 
the  probable  result  is  a  helical  configuration. ^'-^^  The  resulting  overall  shape 
is  that  of  a  long,  thin  rod,  as  illustrated  by  Fig.  \e.  The  most  familiar  helical 
configuration  is  the  Pauling  a-helix.^^  This  configuration  describes  certain 
polypeptides  in  some  solvents,  e.g.  poly-y-benzyl  glutamate  in  dimethyl- 
formamide.^^ 

(3)  Compact,  rigid,  impenetrable  spheres.  If  the  only  important  force  in  a 
polymer-solvent  system  is  either  solvent-solvent  attraction  or  non-specific 
segment-segment  attraction  (or  both)  then  segment-solvent  contacts  will  be 
minimized,  a  result  best  achieved  by  insolubility  of  the  polymer  in  the  sol- 
vent. If,  however,  the  polymer  contains  more  than  one  kind  of  segment, 
and  if  some  of  the  segments  have  strong  affinity  for  solvent,  segment-segment 
or  solvent-solvent  attraction  being  otherwise  the  predominant  force,  then 
the  polymer  will  be  soluble  and  its  optimum  configuration  will  be  one  which 
minimizes  segment-solvent  contacts  except  for  those  segments  which  have 
strong  affinity  for  solvent.  An  ideaUzation  of  the  resulting  configuration  is 
a  compact,  rigid  sphere,  impermeable  to  solvent  (Fig.  \a).  The  segments 
attracted  to  solvent  are  on  the  outside.  The  remainder  fill  the  interior  space, 
which  is  devoid  of  solvent. 

The  three  extremes  just  discussed  were  chosen  because  they  may  be  un- 
equivocally distinguished  by  almost  any  physical  measurement.  For  instance, 
compact  rigid  spheres  have  much  smaller  radii  of  gyration  and  much  larger 

*  The  definition  here  given  in  terms  of  the  stability  of  individual  segment-solvent  pairs 
is,  of  course,  an  oversimplification.  Flory24  should  be  consulted  for  a  full  discussion  of 
this  subject. 
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sedimentation  and  diffusion  coefficients  than  molecules  of  the  same  mole- 
cular weight  which  are  random  coils  or  rigid  rods.  To  distinguish  between 
random  coils  and  rigid  rods  is  equally  easy  if  the  macromolecule  under  con- 
sideration can  be  prepared  with  a  range  of  molecular  weights.  The  radius 
of  gyration,  for  example,  would  then  be  proportional  to  the  molecular  weight 
for  rods  and  roughly  to  the  square  root  of  the  molecular  weight  for  random 
coils.  For  naturally-occurring  substances  of  fixed  molecular  weight  the  dis- 
tinction between  rods  and  random  coils  generally  requires  more  subtle  ana- 
lysis. The  present  paper  is  concerned  only  with  macromolecules  bearing 
electric  charges.  It  will  appear  subsequently  that  such  charges  have  a  con- 
siderable effect  on  the  configuration  of  randomly-coiled  molecules,  enabling 
us  to  distinguish  them  readily  from  rigid  rods. 

The  simplest  physical  measurement  which  will  distinguish  between  the 
extreme  configurations  here  under  discussion  is  the  intrinsic  viscosity,  [r]]. 
For  rigid  spheres  of  unit  density  [17] =2-5  cc./gram,  independent  of  mole- 
cular weight.  For  rigid  rods  or  random  coils,  [q]  is  much  larger:  it  varies 
roughly  as  M^  for  the  former  and  as  M°-^'^°  for  the  latter  (where  M  = 
mol.  wt.). 

It  would  be  naive,  of  course,  to  expect  all  macromolecules  to  fall  into 
one  of  the  extreme  configuration  classes  just  discussed.  Intermediate  con- 
figurations which  undoubtedly  exist  are  helical  rods  with  a  small  number 
of  breaks,  random  coils  with  some  cross-links,  compact  spherical  regions 
joined  by  short  lengths  of  randomly-coiled  chain,  etc.  To  distinguish  these 
from  one  another  in  an  unequivocal  way  is  at  present  not  possible. 

ISOELECTRIC    GLOBULAR    PROTEINS 
IN    AQUEOUS    SOLUTION 

It  is  clear  from  the  preceding  discussion  that  the  configuration  of  macro- 
molecules in  solution  is  determined  by  the  properties  of  the  solvent  as  much 
as  by  those  of  the  macromolecules.  Thus,  the  configuration  of  a  protein 
molecule  is  likely  to  differ  from  one  solvent  to  another,  and  in  any  solvent 
may  differ  from  the  configuration  in  the  crystalHne  state.  This  paper  will 
concern  itself  with  the  configuration  of  common  globular  proteins  in  dilute 
aqueous  solution.  The  conclusions  reached  will  not  apply  to  crystalline  pro- 
teins or  to  proteins  in  other  solvents.  Indeed,  it  is  well-established  that  con- 
figurations of  proteins  in  solvents  such  as  concentrated  aqueous  urea  differ 
markedly  from  what  they  are  in  water. 

When  many  of  the  common  globular  proteins,  in  aqueous  solution,  are 
examined  at  or  near  their  isoelectric  points,  they  can  usually  at  once  be 
assigned  to  a  configuration  much  closer  to  that  of  a  compact,  rigid,  im- 
penetrable sphere  than  to  the  other  extremes  mentioned  earher.  This  is 
illustrated,  for  example,  by  the  intrinsic  viscosities  listed  in  Table  1.  A 
similar  conclusion  is  reached  from  all  other  pertinent  measurements. 

It  should  be  noted  that  three  of  the  proteins  listed  in  Table  1,  fibrinogen. 
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collagen  and  myosin,  obviously  do  not  resemble  compact  spheres  in  aqueous 
solution.  The  configuration  of  these  proteins  thus  presents  a  problem  dif- 
ferent from  that  of  the  other  proteins  listed.  We  shall  not  discuss  these 
proteins  in  this  paper.  It  is  worth  noting,  however,  that  their  different  con- 
figuration is  not  primarily  a  result  of  their  high  molecular  weight.  Other 
proteins  of  high  molecular  weight,  such  as  fumarase^^  and  catalase,^^-^^ 
appear  from  sedimentation  and  diffusion  studies  to  have  a  compact  con- 
figuration near  their  isoelectric  points. 

Table  1 

INTRINSIC  VISCOSITIES  OF  ISOELECTRIC  GLOBULAR  PROTEINS 
IN  AQUEOUS  SOLUTION 


Rigid  impenetrable  sphere« 

Mol.  Wt. 

[t?],  cc./g. 

any 

1-9 

Random  coil^ 

50,000 

47 

Random  coil* 

560,000 

91 

Helical  rod'^ 

66,000 

45 

Helical  rod^ 

340,000 

262 

Ribonucleaseio 

13,683 

3-3 

Myoglobin^io 

17,000 

3-1 

^-Lactoglobulin^ 

35,000 

3-4 

Pepsin^i 

35,000 

3-4 

Ovalbumin^^ 

44,000 

4-3 

Serum  Albumin^^ 

65,000 

3-7 

Hemoglobin".  89 

67,000 

3-6 

Conalbumin^^ 

77,000 

3-8 

Fibrinogen35.7û 

330,000 

27 

Collagen^ 

345,000 

1150 

Myosin^ß 

550,000 

230 

{a)  Unsolvated  sphere  of  density  1  -33,  by  Einstein's  equation.22 

{b)  Polyisobutylene  in  cyclohexane.^^ 

(c)  Poly-y-benzyl  glutamate  in  dimethyl  formamide.^^ 

Two  questions  arise  in  connection  with  those  proteins  which  do  possess 
a  compact,  close  to  spherical  shape.  The  first  (and  the  less  interesting)  con- 
cerns the  meaning  of  the  relatively  small  deviations  of  the  observed  viscosity 
(or  of  other  pertinent  properties)  from  the  theoretical  values  for  perfect 
rigid,  solvent-free  spheres.  Part  of  the  deviation  undoubtedly  results  from 
incorporation  of  some  solvent  in  the  dissolved  particle.*-^''*  Deviations  from 
spherical  shape^  and  non-rigidity  presumably  account  for  the  remainder. 
Attempts  to  obtain  quantitatively  the  contribution  of  each  of  these  factors 

*  It  is  important  to  note,  however,  that  the  water  so  incorporated  must  be  largely  bound 
or  trapped  water.  That  the  protein  particle  is  not  permeable  to  the  solvent  as  a  whole  is 
shown  unequivocally  by  the  effect  of  ionic  strength  on  titration  curves. s 7  For  compact 
proteins  this  effect  is  much  smaller  than  it  would  be  if  salt  ions  could  enter  the  space 
between  charges,  where  they  would  most  effectively  shield  interaction  between  charges. 
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have  had  poor  success. ^^  (See,  for  instance,  the  criticisms^- ■^^•^^•^"*  of  a 
recent  attempt  by  Scheraga  and  Mandelkern.'^) 

More  interesting  than  the  small  deviations  from  the  ideal  spherical  model 
is  the  question  why  the  behavior  of  these  proteins  is  as  close  as  it  is  to  the 
behavior  of  rigid  spheres.  In  other  words,  what  are  the  forces  which  main- 
tain so  compact  a  structure?  As  was  pointed  out  earlier,  these  can  be  attrac- 
tive forces  between  solvent  molecules  or  attractive  forces  between  segments 
of  the  polypeptide  chain,  or  both.  In  addition,  the  fact  that  aqueous  solu- 
tions of  these  proteins  are  stable  at  all  requires  the  existence,  at  the  surface 
of  the  molecular  particle,  of  a  few  segments  of  the  polypeptide  chain  which 
have  strong  affinity  for  solvent.  The  following  pages  will  list  in  turn  each 
of  the  forces  which  may  make  a  contribution,  and  will  try,  on  the  basis  of 
general  chemical  principles,  to  assess  the  relative  importance  which  each 
might  have.  Most  of  the  points  which  will  be  brought  out  have  been  made 
before,  notably  in  a  recent  review  by  Kauzmann, ^^  They  bear  repeating, 
however,  as  a  deterrent  against  the  dangerous  (though  popular)  practice  of 
ascribing  the  stability  of  compact  proteins  to  one  force  alone. 

Attraction  between  solvent  molecules.  One  of  the  relevant  forces  which  with- 
out doubt  contributes  to  the  stability  of  compact  configurations  is  the  hydro- 
gen bonding  between  water  molecules.  This  force  repels  non-polar  groupings 
since  these  would  create  holes  in  the  hydrogen  bonded  structure  of  the 
solvent.  In  aqueous  detergent  solutions  this  force  acts  intermolecularly,  caus- 
ing the  non-polar  portions  of  detergent  molecules  to  aggregate,  with  the 
formation  of  micelles.  In  proteins  there  are  many  non-polar  side  chains  on 
a  single  molecule.  The  same  force  is  therefore  expected  to  act  intramolecularly 
and  to  lead  to  a  compact  structure  with  the  non-polar  side  chains  in  the 
center.  Kauzmann**  has  used  the  term  'hydrophobic'  bonding  to  describe 
the  intramolecular  attraction  which  results  from  this  force,  and  we  shall 
employ  the  same  term. 

Electrostatic  forces  between  charged  segments  of  the  protein  chain.  Isoelectric 
proteins  have  a  net  charge  of  zero.  The  molecules  possess,  however,  a  large 
number  of  charges,  both  positive  and  negative,  Coulombic  forces  between 
these  charges  must  exist,  and  they  provide  a  second  factor  which  inevitably 
affects  the  stability  of  a  compact  configuration.  The  contribution  of  the 
interaction  between  these  charges  to  the  free  energy  of  a  compact  protein 
molecule  can  be  calculated  by  the  method  of  Kirkwood,*'  which  takes  into 
account  the  fact  that  the  dielectric  constant  within  the  molecule  is  different 
from  that  of  the  solvent,  Kirkwood's  method  has  recently  been  adapted  by 
the  author^^-^"  to  estimation  of  the  electrostatic  interaction  energy  of  spheri- 
cal protein  molecules  with  specified  locations  of  charges.  In  this  method, 
the  molecule  is  treated  as  being  impermeable  to  solvent  and  the  charges 
are  taken  to  be  at  the  surface,  in  contact  with  solvent,  such  that  the  distance 
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of  closest  approach  between  a  charge  and  a  water  molecule  is  the  same  as 
it  would  be  in  a  low  molecular  weight  organic  ion  in  aqueous  solution. 

Fig.  2  shows  three  model  structures  for  which  calculations  have  been  made 
for  this  paper.  All  calculations  were  averaged  over  all  configurations  result- 
ing from  permutation  of  charges  over  identical  sites.  They  are  approximate 
in  that  the  Bragg- Williams  method^"^  was  used.  (This  is  equivalent  to  reten- 
tion of  only  one  term  in  equation  33  of  Tanford  and  Kirkwood.^'^)  For 
each  model,  two  isoelectric  forms  are  likely  to  occur,  one  with  protons 
removed  from  every  carboxyl  group  and  from  no  other  groups,  the  second 
with  protons  removed  from  all  but  one  of  the  carboxyl  groups  and  from 
one  of  the  imidazolium  groups.  The  corresponding  electrostatic  interaction 


-6400 
-5200 


MODEL  B 
ELECTROSTATIC  INTERACTION    ENERGIES  (CALS./MOLE.) - 

-10800  +6600 

-10000  +13200 


Fig.  2.  Models  used  for  calculation  of  the  electrostatic  contribution  to  the  free  energy 
of  compact  configurations.  The  letters  C,  G,  I,  N  and  O  represent,  respectively,  carboxyl, 
guanidine,  imidazole,  amino  and  phenolic  groups.  The  dodecahedra  are  inscribed  in 
spheres  of  radius  10  Â,  with  the  vertices  1  Â  below  the  surface,  this  being  the  distance 
of  closest  approach  of  solvent  to  the  charged  sites  on  any  organic  acid  or  base.^°  The 
distance  between  nearest  neighboring  vertices  in  6-5  Â. 


energies  are:  for  model  A,  —6400  and  —5200  calories/mole;  for  model  B, 
-10,800  and  -10,000  calories/mole;  for  model  C,  +13,200  and  +6600 
calories/mole.  These  calculations  are  for  isolated  protein  molecules  at  zero 
ionic  strength.  The  addition  of  inorganic  salts  reduces  these  values,  but  for 
impermeable  molecules,  the  effect  is  relatively  small,  as  Table  2  shows, 
because  the  salt  ions  cannot  enter  the  space  between  charges,  where  they 
would  be  most  effective. 

These  calculations  clearly  show  that  the  contribution  of  electrostatic  inter- 
action to  the  free  energy  of  isoelectric  protein  molecules  can  be  very  large, 
and  that  the  magnitude  of  the  effect  depends  markedly  on  the  positions  of 
charged  sites  relative  to  one  another.  The  interaction  energy  may  be  nega- 
tive or  positive.  However,  models  leading  to  positive  interaction  energies 
(such  as  model  C)  require  unlikely  arrangement  of  the  charges  with  most 
of  the  positive  charges  well  separated  from  most  of  the  negative  charges. 
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The  fact  that  most  protein  molecules  in  solution  have  very  low  permanent 
dipole  moments,^^  strongly  suggests  that  positive  and  negative  charges  are 
in  fact  more  or  less  evenly  distributed.  Such  an  arrangement  always  leads 
to  a  net  negative  electrostatic  interaction  energy  for  the  isoelectric  state, 
with  the  result  that  electrostatic  forces  contribute  materially  to  the  stability 
of  compact  configurations.* 

Further  calculations  for  these  same  models  will  be  reported  in  Table  2. 
It  should  be  noted  that  the  isoelectric  point  is  not  necessarily  the  point  at 
which  the  electrostatic  interaction  energy  is  a  minimum.  For  model  C,  for 
instance,  the  minimum  occurs  when  the  net  charge  is  —4. 

Intramolecular  hydrogen  bonds.  A  third  force  is  provided  by  intramolecular 
hydrogen  bonds. ^^-^^'^^-^^  It  is  convenient  to  distinguish  between  two  kinds 
of  intramolecular  hydrogen  bonds.  The  first  is  the  familiar  peptide  hydrogen 
bond  (  — C  =  0. .  .HN=).^'"^^  The  second  kind  is  the  hydrogen  bond  be- 
tween side-chain  groups. ^^  Examples  of  the  latter  are  the  bond  between 
undissociated  carboxyl  groups,  well  known  in  the  dimeric  form  of  carboxylic 
acids  in  some  nonaqueous  solvents,  and  the  frequently  postulated  phenol- 
carboxylate  bond.  Such  hydrogen  bonds  are  probably  the  principal  factor 
determining  the  configuration  of  proteins  in  the  solid  state. ^^  In  aqueous 
solutions,  however,  there  is  a  competition  between  formation  of  such  intra- 
molecular bonds  and  formation  of  hydrogen  bonds  between  the  same  groups 
and  water  molecules,  so  that  their  effect  on  configuration  is  likely  to  be 
reduced.  When  appropriate  small  molecules  are  studied,  to  which  the  same 
kind  of  competition  applies,  equilibrium  is  found  to  lie  on  the  side  of  hydro- 
gen bonds  to  water.  Thus  Schellman''"  has  calculated,  from  the  thermo- 
dynamic behavior  of  urea  in  water  solution,  that  in  such  solutions  the 
association  of  two  urea  molecules  by  hydrogen  bonds  of  the  peptide  type 
is  accompanied  by  3.  positive  free  energy  change  (zlF°  =  +  1990  calories/mole). 
Similarly,  for  the  association  of  tyrosine  with  acetate^"''  by  formation  of 
phenol-carboxylate  hydrogen  bonds,  AV°>  1400  calories/mole. 

Of  course,  these  model  reactions  are  between  small  molecules  which  are 
free  in  solution,  and  are  therefore  accompanied  by  a  loss  of  translational 
entropy.  The  formation  of  the  corresponding  bonds  between  segments  of 
a  polypeptide  chain  involves  in  place  of  this  factor  a  loss  of  rotational 
entropy.  The  difiference  between  these  quantities  could  well  be  sufficiently 
large  to  change  the  sign  of  the  free  energy  of  formation  of  such  hydrogen 
bonds  in  aqueous  protein  solutions.  In  fact,  Schellman'^  made  a  calcula- 
tion to  show  that  this  is  just  what  would  happen  in  the  case  of  peptide 

*  The  contribution  of  electrostatic  forces  to  the  stability  of  a  compact  configuration 
is,  of  course,  the  difference  between  the  figures  calculated  and  the  corresponding  inter- 
action energies  in  a  randomly  coiled  configuration.  The  latter,  however,  are  comparatively 
small  because  the  space  between  charges  in  solvent-permeated  configuration  is  filled  by 
material  of  high  dielectric  constant. 
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hydrogen  bonds,  but  in  a  later  paper,^^  this  prediction  was  withdrawn.  For 
formation  of  phenol-carboxylate  hydrogen  bonds  in  proteins,  Laskowski 
and  Scheraga^^  h^ve  calculated  a  large  loss  of  rotational  entropy  which  is 
of  the  same  order  of  magnitude  as  the  probable  loss  of  translational  entropy 
in  the  tyrosine-acetate  reaction  mentioned  above. 

Perhaps  even  more  important  is  the  fact  that  hydrogen  bonds  in  proteins 
can  occur  in  conjunction  with  other  intramolecular  forces,  such  as  hydro- 
phobic forces.  For  instance,  when  several  amino  acid  residues,  adjacent  in 
sequence,  are  all  non-polar,  then  they  will  all  tend  to  be  located  in  the 
interior  of  the  molecule  and  will  carry  with  them  the  part  of  the  peptide 
chain  to  which  they  are  attached.  This  part  of  the  polypeptide  chain,  being 
removed  from  contact  with  water,  will  presumably  tend  to  form  peptide 
hydrogen  bonds,  and  the  ability  to  form  such  bonds  will  contribute  to  the 
stability  of  the  hydrophobic  bonding.* 

That  hydrogen  bonds  are  in  fact  present  in  the  compact  forms  of  globular 
proteins  is  strongly  suggested  by  several  kinds  of  experiments.  The  deter- 
mination of  optical  rotatory  power,  for  example  by  Yang  and  Doty,^^^ 
indicates  that  globular  proteins  in  aqueous  solutions  are  partially  in  the 
form  of  helices.  This  almost  certainly  reflects  the  presence  of  peptide  hydro- 
gen bonds.  The  deuterium-hydrogen  exchange  experiments  of  Linderstrom- 
Lang  and  Hvidt^^-^^  show  that  some  exchangeable  hydrogen  atoms  are 
more  slowly  exchanged  than  others.  This  result  could  arise  if  peptide  or 
amide  groups  (for  example)  were  in  the  hydrophobic  interior  of  the  molecule 
without  hydrogen  bonding,  but  it  is  usually  assumed  that  if  such  groups  are 
so  located  they  will  also  tend  to  be  hydrogen-bonded.  Finally,  it  will  be  seen 
below  that  in  at  least  two  proteins  the  stability  of  the  compact  form  towards 
pH  suggests  the  presence  of  hydrogen  bonds  or  intramolecular  ion  pairs 
between  titratable  side-chain  groups.  It  is  of  interest  in  this  connection  that 
the  titration  curves  of  some  simple  dicarboxylic  acids ^"^  with  bulky  non- 
polar  side  chains  indicate  the  presence  of  weak  intramolecular  bonds  between 
— COOH  and  —COO". 

Intramolecular  ion  pairs.]  The  attraction  between  positive  and  negative  ions 
leads  to  close  association  between  them  in  the  crystalhne  state  and  to  the 
formation  of  ion  pairs  in  non-aqueous  solvents.  In  water,  however,  the  free 
energy  of  hydration  is  sufficiently  negative  to  prevent  ion  pair  formation, 
at  least  between  univalent  ions.  Even  guanidinium  and  acetate  ions,  the 
association  of  which  would  be  favored  by  hydrogen  bonds  as  well  as  electro- 

*  Hydrogen  bonds  can  be  important  even  if  they  do  not  contribute  to  thermodynamic 
stability  by  blocking  kinetic  pathways  to  more  extended  configurations.  For  example, 
the  activated  state  for  unfolding  could  be  one  in  which  intramolecular  hydrogen  bonds 
are  ruptured,  but  in  which  new  hydrogen  bonds  to  water  are  not  yet  formed. 

f  This  new  term  is  suggested  as  more  descriptive  than  the  term  'salt  bridge'  which  has 
sometimes  been  used  in  the  past  to  describe  this  kind  of  bond. 
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Static  attraction,  do  not  associate  appreciably.^^  However,  as  in  the  case 
of  hydrogen  bonds,  there  will  be  a  difference  in  entropy  between  the  forma- 
tion of  intermolecular  ion  pairs  between  small  ions  and  the  formation  of 
corresponding  bonds  between  protein  side  chains  of  opposite  charge.  There 
is  no  experimental  evidence  at  present  which  would  enable  one  to  arrive 
at  a  decision  concerning  the  presence  of  such  intramolecular  ion  pairs  in 
compact  protein  molecules  or  on  their  contribution  to  the  maintenance  of 
a  compact  structure.  (In  calculating  the  energy  of  electrostatic  interaction 
earlier  in  this  paper,  the  absence  of  ion  pair  formation  was  assumed  :  the 
closest  distance  of  approach  of  two  charges  in  the  models  used  is  6-5  Â.) 

Hydrophilic  groups  at  the  surface.  We  have  discussed  the  attractive  forces 
between  solvent  molecules  and  the  attractive  forces  between  segments  of 
the  polypeptide  chain,  which,  acting  alone  or  together,  could  stabihze  a  com- 
pact configuration  for  protein  molecules  in  aqueous  solution.  As  was  pointed 
out  earHer,  an  additional  requirement  for  the  stabihty  of  such  a  configura- 
tion is  the  presence  of  a  small  number  of  surface  sites  with  strong  afiinity 
for  the  solvent.  For  proteins  in  aqueous  solution,  the  side  chain  groups 
which  bear  charges  would  seem  most  likely  to  fill  this  requirement.  Indeed, 
there  is  strong  evidence  from  titration  curves  that  most  of  these  groups  are 
in  fact  in  contact  with  solvent  in  every  globular  protein.  Ribonuclease,  for 
example,  maintains  its  compact  configuration  through  most  of  the  accessible 
pH  range  (see  below).  The  titration  curve^^  shows  that  every  one  of  the 
carboxyl,  imidazole  and  amino  groups  (but  not  all  phenolic  groups:  see 
below)  which  this  protein  is  known  to  possess  is  titrated  reversibly  in  this 
pH  range.  Moreover,  there  is  no  conflict  between  the  observed  titration 
curve  and  a  calculated  curve  based  on  {a)  intrinsic  dissociation  constants 
identical  with  these  found  in  appropriate  substances  of  low  molecular  weight, 
{b)  electrostatic  interaction  calculated  with  the  assumption  that  charges  are 
as  close  to  the  surface  as  in  low  molecular  weight  organic  acids.  Since  any 
variation  in  the  closeness  of  charged  sites  to  the  solvent  results  in  large 
changes  in  electrostatic  self-energy,^'  the  presence  of  as  few  as  one  or  two 
of  these  titratable  groups  in  the  interior  of  the  molecule  would  at  once  be 
detected,  either  by  failure  to  titrate  these  groups  at  all  or  at  least  by  pro- 
nounced shifts  in  their  dissociation  constants. 

Ovalbumin  is  another  protein  which  maintains  a  compact  configuration 
over  most  of  the  accessible  pH  range.  The  titration  curve^^  again  requires 
that  essentially  all  charged  sites  be  at  the  surface.  The  same  conclusion 
apphes  with  somewhat  less  force  to  most  other  proteins.  Many  of  these  do 
not  maintain  their  compact  configuration  over  a  sufficient  range  of  pH  to 
allow  observation  of  the  actual  titration  of  the  majority  of  charged  sites  in 
the  absence  of  configurational  change.  One  can  often,  however,  deduce  the 
presence  of  titratable  groups  with  normal  properties  even  if  they  are  not 
titrated  directly.  It  is  unhkely  that  any  of  the  small  globular  proteins  has 
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as  many  as  ten  per  cent  of  its  carboxyl,  amino,  imidazole  or  guanidine  groups 
anywhere  other  than  in  direct  contact  with  solvent. 

Hemoglobin  has  been  cited  as  a  possible  exception  to  this  generaliza- 
tion.'^'^^ This  protein  undergoes  a  reaction  below  pH  4  which  is  accom- 
panied by  the  sudden  binding  of  18  protons  per  molecule  (molecular  weight 
33,500).  The  protein  acts  as  if  18  carboxyl  groups  in  their  anionic  form  were 
not  available  to  titration  above  pH  4.  This  conclusion  is  erroneous,  however. 
The  reaction  of  hemoglobin  below  pH  4  is  accompanied  by  breakdown 
of  the  compact  structure  (see  below).  As  a  result,  there  is  a  decrease  in 
electrostatic  interactions  and  this  necessarily  results  in  the  sudden  binding 
of  about  15  protons^^  to  sites  which  were  available  all  the  time  and  were 
not  occupied  by  protons  solely  because  of  the  repulsive  force  of  other  posi- 
tively charged  sites.  Thus,  if  any  of  the  carboxylate  side  chains  of  hemo- 
globin are  not  available  to  titration,  the  number  must  be  quite  small.  It  is 
probable  that  the  explanation  here  given  accounts  for  the  apparent  unavail- 
ability of  prototropic  sites  in  some  other  proteins.^ ^ 

Of  the  side  chain  groups  subject  to  titration  with  acid  or  base,  there  is 
one  kind  which  has  been  found  'unavailable',  i.e.  presumably  in  the  interior, 
in  some  proteins.  These  groups  are  the  phenohc  groups.  In  ovalbumin^^ 
all  of  these  groups  are  unavailable  as  long  as  the  compact  configuration 
is  maintained;  in  ribonuclease^^  three  out  of  six  phenolic  groups  are  un- 
available; in  conalbumin,^"^  too,  about  half  are  unavailable.  Phenolic  groups, 
however,  are  not  charged  at  the  pH  at  which  these  proteins  exist  in  nature. 
Uncharged  phenolic  groups  have  no  particular  affinity  for  water  and  to  find 
some  of  them  in  the  'hydrophobic'  interior  of  protein  molecules  is  therefore 
no  contradiction  of  the  generalization  made  in  this  section. 

Disulfide  bonds.  The  discussion  so  far  has  made  only  incidental  reference 
to  the  effect  of  cross-linkages  on  the  configuration  of  polymeric  chain  mole- 
cules. Most  proteins  possess  such  links  in  the  form  of  disulfide  bonds.  Their 
principal  effect  is  to  limit  the  range  of  possible  extended  configurations: 
configurations  such  as  illustrated  by  Fig.  \d  may  become  unattainable  as 
long  as  disulfide  bonds  are  intact.  This  factor  will  indirectly  increase  the 
stability  of  a  compact  configuration,  as  compared  to  that  of  a  randomly- 
coiled  configuration.  The  chief  factor  favoring  the  latter  is  its  high  con- 
figurational  entropy.  The  presence  of  cross-links,  however,  decreases  the 
number  of  possible  configurations  available  to  the  randomly  coiled  state,  and 
hence  reduces  the  entropy  gain  which  would  result  from  a  transition  to  this  state. 
There  is  another  possible  effect  of  the  presence  of  disulfide  cross-links 
which  works  in  the  opposite  direction.  To  gain  maximum  benefit  from  'hydro- 
phobic' bonds,  electrostatic  interactions,  etc.,  requires  freedom  to  bring 
appropriate  portions  of  a  polypeptide  chain  into  each  other's  vicinity.  Di- 
sulfide bonds  interfere  with  this  freedom  so  that  their  presence  may  make 
a  compact  structure  less  stable  than  it  otherwise  would  have  been. 
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There  is  no  way  of  predicting  which  of  these  effects  will  predominate. 
If  ribonuclease  is  oxidized  with  performic  acid  so  that  all  disulfide  bonds 
are  broken,  it  loses  its  compact  configuration  even  at  the  isoelectric  point. ^^ 
On  the  other  hand,  ovalbumin,  which  contains  just  a  single  disulfide  bond 
in  its  natural  state  is  one  of  the  proteins  most  resistant  to  loss  of  its  com- 
pact form,  whether  by  acid,  base  or  urea.  Finally,  serum  albumin,  with 
about  16  disulfide  bonds,  loses  its  compact  configurational  more  readily 
than  most  proteins.  Another  aspect  of  the  presence  of  disulfide  bonds  is 
that  they  are  hkely  to  facilitate  the  reversibility  of  configurational  changes, 
as  Kauzmann**  has  pointed  out. 

It  is  evident  from  the  preceding  discussion  that  no  basic  principles  of 
chemistry  and  no  conclusive  experimental  tests  allow  us  to  state  precisely 
what  forces  maintain  a  compact  configuration  for  so  many  isoelectric  pro- 
teins in  aqueous  solution.  Much  experimental  work  is  currently  going  on 
in  many  laboratories  towards  the  elucidation  of  this  problem.  The  most 
fruitful  approach  has  been  to  study  the  conditions  under  which  compact 
configurations  break  down.  Such  breakdown  occurs  quite  generally  when 
the  solvent  is  changed,  when  an  aqueous  solution  is  heated,  when  acid  or 
base  is  added,  etc.  It  is  usually  called  denaturation,  although  some  definitions 
of  this  term  would  not  include  rapidly  reversible  changes  in  configuration. 
A  recent  general  review  by  Putnam^*  and  the  paper  by  Kauzmann**  earlier 
referred  to  provide  a  summary  of  results  which  have  been  obtained.  Unfor- 
tunately, these  results  have  not  as  yet  progressed  to  where  they  can  provide 
an  answer  to  the  questions  which  we  have  posed.  In  general,  kinetic  studies 
of  denaturation  predominate.  There  have  been  few  attempts,  however,  to 
characterize  the  configuration  of  the  denatured  form  or  to  determine  the 
thermodynamics  of  the  denaturation  process.*  The  studies  have  gone  far 
enough,  however,  to  indicate  that  every  protein  behaves  in  an  individual 
way  towards  various  agents  which  cause  configurational  change.  As  a  result, 
it  may  well  be  that  different  proteins  possess  a  compact  configuration  for 
quite  different  reasons. 

The  remainder  of  this  paper  will  summarize  recent  information  obtained 
by  the  present  author  and  co-workers,  and  by  others,  on  configurational 
changes  which  accompany  the  titration  of  proteins  in  aqueous  solution.  The 
results  will  serve  to  illustrate  the  conclusion  that  we  are  still  far  from  a 
solution  of  the  problems  which  we  set  out  to  solve. 

THE    EFFECT    OF    pH    ON    CONFIGURATION 

When  acid  or  base  is  added  to  an  isoelectric  protein  solution,  three  things 

*  The  lack  of  thermodynamic  data  results  in  large  part  from  the  fact  that  denaturation 
has  usually  been  studied  under  irreversible  conditions.  An  unfortunate  consequence  is 
that  we  do  not  know  for  most  proteins  whether  the  observed  range  of  stability  of  a  com- 
pact configuration  reflects  conditions  under  which  this  configuration  is  truly  the  form  of 
lowest  free  energy,  or  whether  maintenance  of  compactness  is  merely  a  kinetic  phenomenon. 
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happen:  (1)  the  pH  of  the  solution  is  changed,  (2)  protons  are  removed  from 
or  added  to  appropriate  acidic  or  basic  groups,  (3)  the  net  charge  of  the  pro- 
tein molecule  is  increased.  Of  these  changes,  the  last-named  is  the  one  most 
likely  to  affect  the  configuration,  as  was  pointed  out  as  early  as  1931  by 
Wu.^°^  For  the  increase  in  net  charge  necessarily  results  in  repulsion  be- 
tween charges  and  a  net  positive  contribution  of  electrostatic  interaction 
to  the  free  energy  of  a  compact  configuration.  Some  calculations  for  the 
models  of  Fig.  2  are  shown  in  Table  2.  They  were  made  using  the  same 
equations  and  assumptions  discussed  earlier.  Table  2  shows  changes  in 
electrostatic  interaction  energy  between  10,000  and  20,000  calories  for  the 
models  used,  which  contain  12  or  20  charged  sites.  The  corresponding 
changes  for  real  proteins  can  easily  amount  to  100,000  calories/mole. 

Table  2 

THE   CONTRIBUTION   OF  ELECTROSTATIC   INTERACTION  TO 
THE    FREE    ENERGY   OF   COMPACT    SPHERICAL    PROTEINS 


Interaction  Energy,  cal. /mole* 

Net 

Model  A 

Model  B 

Model  C 

Charge 

1^=0 

/x=0015 

/i=0060 

M=0 

,^=0 

+9 

+25340 

+  6 

+  7530 

+  5880 

+  5110 

+  8510 

+  14100 

+  5 

+  11950 

+  4 

+  380 

-  30 

-  170 

+  250 

+  10570 

+  3 

-2260 

-2220 

-2130 

-  3190 

+  10030 

+2 

-4260 

-3880 

-3610 

-  6170 

+  10270 

+  1 

-5640 

-5030 

-4640 

-  8700 

+  11320 
+  5210 

0 

-6390 

-5650 

-5200 

-10770 

+  6610 

-5240 

-4630 

-4260 

-10030 

+  13150 

-1 

-5440 

-4820 

-4440 

-11630 

+  8800 
+  3660 

-2 

-4060 

-3670 

-3400 

-11110 

+  6200 
+  5970 

-3 

-2270 

-2220 

-2120 

-  9210 

+  3920 

-1210 

-1170 

-1100 

-  6570 

+  8510 

-4 

+  1100 

+  700 

+  550 

-  4350 

+  1960 
+  7060 

-5 

+  5910 
+  10610 

-6 

+9410 

+  7750 

+  6930 

+  6860 

+  15240 

-9 

+25930 

*  The  model  structures  to  which  these  calculations  apply  are  shown  in  Fig.  2.  Where 
two  values  are  shown  for  the  same  net  charge  they  refer  to  alternate  forms  which  are 
likely  to  be  present  with  that  net  charge.  For  model  C,  for  example,  with  a  charge  of 
—4,  the  first  figure  is  for  the  form  with  protons  removed  from  two  phenoHc  groups,  the 
second  for  the  form  with  protons  removed  from  one  phenoUc  and  one  amino  group. 
11  stands  for  ionic  strength. 
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The  effect  of  ionic  strength  on  interaction  energies,  for  impenetrable 
models  such  as  these,  is  quite  small.  This  is  shown  by  the  calculation  of  this 
effect  for  model  A.  It  should  be  noted,  incidentally,  that  the  choice  of  im- 
penetrable models  is  not  an  arbitrary  one.  As  already  mentioned,  it  is  forced 
on  us  by  the  small  experimentally  observed  effect  of  ionic  strength  on  titra- 
tion curves  of  compact  proteins.^' 

Table  2  indicates  that,  whatever  the  forces  stabilizing  a  compact  con- 
figuration near  the  isoelectric  point,  electrostatic  interaction  is  the  principal 
force  which  makes  such  a  configuration  unstable  as  one  moves  away  from 
the  isoelectric  region.  For  the  strength  of  'hydrophobic'  bonding,  of  peptide 
hydrogen  bonds  and  of  disulfide  bonds  is  not  affected  at  all  by  pH  changes. 
Hydrogen  bonds  between  side  chain  groups  and  intramolecular  ion  pairs 
are  affected,  since  titration  of  the  side  chain  groups  involved  in  such  bonds 
would  lead  to  their  weakening  or  rupture.  However,  all  the  energy  contri- 
buted in  this  way  to  a  shift  in  the  free  energy  balance  between  a  compact 
and  an  extended  configuration  must  be  reflected  by  an  exactly  equal  anoma- 
lous free  energy  of  ionization  of  the  groups  concerned.  Whether  such 
anomahes  in  fact  occur,  is  debatable.  If  one  assumes  a  uniform  density  of 
charged  sites  on  the  surface  of  the  protein  molecule,  then  anomalies  certainly 
exist. ^2. 86  jf^  Qjj  ^jje  other  hand,  one  allows  for  reasonable  variation  from 
one  protein  to  another  in  the  relative  locations  of  charged  sites,  then  such 
apparent  anomahes  can  be  explained  on  the  basis  of  electrostatic  interaction 
of  these  sites  with  the  acidic  or  basic  groups  under  consideration.^"  In  any 
event,  any  instabihty  contributed  by  hydrogen  bond  rupture  would  be  an 
addition  to  and  not  a  substitute  for  the  instabihty  due  to  electrostatic  forces. 

It  should  be  noted  again,  however,  that,  though  Coulombic  forces  prob- 
ably provide  the  principal  basis  for  the  thermodynamic  instability  of  a  com- 
pact configuration,  hydrogen  bonds  may  block  the  kinetic  pathway  to  a 
configurational  change.  It  is  not  impossible  to  visualize  mechanisms  by 
which  a  change  in  configuration  is  acid-  or  base-catalysed  so  that  a  change 
in  pH  per  se  may  affect  the  rate  at  which  a  change  in  configuration  can  occur. 

Before  considering  the  effect  of  pH  on  globular  proteins,  it  is  of  interest 
to  examine  its  effect  on  a  simple  synthetic  polyelectrolyte,  such  as  poly- 
methacryhc  acid.  This  polymer  is  isoelectric  when  all  of  its  carboxyl  groups 
are  un-ionized.  It  then  has  no  charges  at  all,  and  is  randomly  coiled.  When 
it  becomes  charged,  by  ionization  of  the  carboxyl  groups,  Coulombic  repul- 
sion sets  in.  Relatively  compact  configurations,  such  as  that  represented  by 
Fig.  \b,  acquire  a  much  higher  electrostatic  interaction  energy  than  relatively 
extended  configurations,  such  as  that  represented  by  Fig.  \d.  The  average 
configuration  therefore  becomes  much  more  extended,  and  this  is  reflected 
by  corresponding  changes  in  viscosity,  sedimentation  coefficient,  etc.  (A 
theoretical  treatment  of  this  effect  has  recently  been  presented  by  Harris 
and  Rice.^*'^'  The  same  authors  have  also  considered  polyampholytes  with 
equal  numbers  of  acidic  and  basic  groups. ^^)  Experimental  verification  of 
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this  effect  is  provided  by  the  viscosity  data  of  Arnold  and  Overbeek^  for 
polymethacrylic  acid,  which  are  shown  in  Fig.  3. 

An  increase  in  ionic  strength  has  a  much  greater  effect  on  electrostatic 
interactions  in  configurations  in  which  the  molecular  domain  is  permeated 
by  solvent  than  it  does  for  the  impenetrable  configuration  to  which  the  cal- 
culation of  Table  2  applies.  As  a  result,  a  moderate  ionic  strength  suffices 
to  eliminate  electrostatic  forces  almost  entirely  with  a  resultant  return  to 
essentially  the  randomly  coiled  configuration  of  the  uncharged  molecule. 
This  effect  is  also  illustrated  by  Fig.  3. 
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Fig.  3.  Reduced  viscosities  in  dilute  solutions  of  polymethacrylic  acid  (Arnold  and 
Overbeek^).  Ionic  strength  is  0010  to  0-017  for  the  top  curve,  0-10  for  the  middle  curve, 
and  10  for  the  lowest  one.  The  concentration  c  was  000085  g./cc.  in  all  experiments. 

Exactly  parallel  behavior  on  the  part  of  globular  proteins  is,  of  course, 
not  to  be  expected.  Isoelectric  polymethacrylic  acid  is  randomly  coiled,  i.e. 
all  possible  configurations  have  approximately  the  same  energy.  Thus,  even 
small  energy  changes  in  the  more  compact  configurations  are  immediately 
(and  reversibly)  reflected  in  an  increase  in  the  proportion  of  molecules  with 
more  extended  configurations.  In  the  globular  proteins  here  under  discus- 
sion, on  the  other  hand,  the  various  possible  configurations  in  the  isoelectric 
state  do  not  have  identical  energy.  Instead,  the  most  compact  configuration 
is  stabiHzed  by  what  may  be  a  large  energy  inherent  in  the  various  forms 
of  intramolecular  attraction  described  earlier  in  this  paper.  The  effect  of 
titration  is  to  make  the  electrostatic  contribution  to  the  free  energy  a  posi- 
tive one  (and,  perhaps,  to  change  the  strength  of  specific  bonds  between 
side  chain  groups).  Only  when  the  resulting  change  in  free  energy  becomes 
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sufficiently  large  to  overcome  the  energy  initially  stabilizing  a  compact  form 
will  transition  to  an  extended  randomly  coiled  configuration  become  pos- 
sible. When  a  transition  does  occur,  the  configurational  change  will  be  much 
more  drastic  than  the  shift  in  average  configuration  which  takes  place  in 
simple  polyelectrolytes,  with  the  result  that  the  transition  may  be  a  slow 
process  and  may  not  occur  under  reversible  conditions.  Another  difference 
between  the  anticipated  behavior  of  globular  proteins  and  the  behavior 
of  simple  polyelectrolytes  is  that  the  former  may  be  prevented  by  the  pre- 
sence of  disulfide  bonds  from  acquiring  configurations  as  extended  as  those 
of  simple  polyelectrolytes. 

It  has  been  noted  already  that  the  stabilization  of  a  compact  configuration 
for  isoelectric  proteins  depends  markedly  on  the  arrangement  of  charged 
sites,  and  that  disulfide  bonds  may  have  a  stabiUzing  effect  in  some  proteins 
and  not  in  others.  'Hydrophobic'  bonding  is  clearly  subject  to  similar  varia- 
tion, and  so  is  any  stabilization  energy  which  may  result  from  hydrogen 
bonding  or  intramolecular  ion  pairs.  It  is  clear  that  some  proteins  may  be 
expected  to  withstand  a  much  higher  net  charge  without  configurational 
change  than  others,  and  this  proves  indeed  to  be  the  case. 

Ribonuclease^"  is  an  example  of  a  protein  with  unusual  stability  in  this 
regard.  Its  intrinsic  viscosity  (Fig.  7)  remains  essentially  equal  to  that  ob- 
served at  the  isoelectric  point  from  pH  2  to  pH  11-5,  and  in  this  range  it 
is  essentially  independent  of  ionic  strength.  The  small  changes  (<10%)  in 
[rj]  which  do  occur  are  of  the  order  of  magnitude  to  be  expected  for  the 
effect  of  charge  ('electroviscous  effect')  on  the  viscosity  of  rigid  impenetrable 
spheres.^ 

We  shall  now  list  some  globular  proteins  which  are  known  to  undergo 
a  major  change  in  configuration  as  a  result  of  pH  changes.  Many  of  the 
examples  given  are  well-known  instances  of  acid-  or  base-induced  'denatura- 
tion'.  The  most  thoroughly  studied  example  (serum  albumin),  however, 
represents  a  very  rapid  and  reversible  reaction  which  would  not  ordinarily 
be  classified  as  a  'denaturation'.  We  shall  not,  on  the  basis  of  the  data 
available,  be  able  to  shed  much  light  on  the  forces  maintaining  the  initially 
compact  configurations.  We  shall  be  able,  however,  to  demonstrate  that  the 
product  of  the  pH-induced  configurational  change,  in  every  instance  which 
has  been  subjected  to  experimental  test,  is,  as  anticipated,  a  flexible  molecule 
exhibiting  behavior  characteristic  of  simple  linear  polyelectrolytes. 

Two  criteria  of  such  behavior  will  be  used.  The  first  is  the  expansion 
of  the  molecular  domain  with  increasing  net  charge  and/or  the  contraction, 
at  constant  charge,  with  increasing  ionic  strength.  This  behavior  is  illus- 
trated by  the  viscosity  data  for  polymethacrylic  acid  shown  in  Fig.  3.  Sedi- 
mentation or  diffusion  coefficients  provide  the  same  information. 

The  second  criterion  is  based  on  the  titration  curve.  In  particular,  we  shall 
examine  the  dependence  on  net  charge  of  the  quantity  pH— log  a/(l— a), 
in  which  a  represents  the  experimentally  observed  degree  of  dissociation  of 
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a  particular  kind  of  group  on  the  molecule.  This  quantity  is  an  effective 
pK  for  the  hydrogen  ion  dissociation  of  the  kind  of  group  being  considered, 
and,  at  constant  ionic  strength  and  temperature,  it  would  be  a  constant 
quantity  independent  of  net  charge  if  there  were  no  interaction  between 
the  many  acidic  and  basic  sites  present  on  a  protein  or  simple  polyelectro- 
lyte  molecule.  Such  interaction  does  in  fact  exist,  however,  as  a  result  of 
ordinary  Coulombic  forces  between  electrostatic  charges. 

Consider  first  a  globular  protein  in  its  compact  impenetrable  configura- 
tion, or  any  other  large  molecule  in  which  the  distances  between  sites  and 
the  penetrability  by  solvent  do  not  change  during  the  course  of  titration. 
Then  each  proton  added  to  the  molecule  will  provide  a  repulsive  force  for 
all  subsequent  protons,  and  each  proton  removed  will  have  the  opposite 
eff'ect.  Thus  pH— loga/(l— a)  will  decrease  with  increasing  net  positive 
charge  and  increase  with  increasing  net  negative  charge.^*^  The  effect  will 
be  greatest  at  low  ionic  strengths,  but  for  impenetrable  configurations  will 
persist  to  high  ionic  strengths.  The  resulting  behavior  is  illustrated  by 
Fig.  5,  which  is  obtained  from  the  titration  of  carboxyl  groups  of  ribonu- 
clease,''^  a  protein  known  from  viscosity  measurements  to  remain  compact 
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Fig.  4.  Logarithmic  plot  for  titration  of  the  free  phenolic  groups  of  ribonuclease  (Tan- 
ford,  Hauenstein  and  Rands^^). 


and  impenetrable  throughout  the  pH  range  under  consideration,  and  by 
Fig.  4,  obtained  from  the  titration  of  the  three  surface  phenolic  groups  of 
the  same  protein. ^^ 

For  simple  flexible  polyelectrolytes  the  effect  of  Coulombic  interaction  is 
quite  different.  The  first  few  protons  added  or  removed  exert  the  expected 
effect.  But  each  increment  of  net  charge  results,  as  we  have  seen,  in  a  greater 
extension  of  the  molecular  domain  and  this  in  turn  leads  to  an  increase  in 
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the  distance  between  charges  and  to  the  presence  of  more  shielding  electro- 
lyte between  them.  The  typical  result,  illustrated  by  Fig.  6,  is  that  pH  — log 
a/(l  — a)  becomes  virtually  independent  of  net  charge.^ 

Two  additional  criteria  will  be  used  as  circumstantial  evidence  for  the 
absence  of  configurational  change.  These  are  optical  rotatory  power  and 
the  rotational  relaxation  time  of  bound  fluorescent  dye  molecules.  These 
properties  are  quite  sensitive  to  configuration,  although  they  do  not  lend 
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Fig.  5.  Logarithmic  plot  for  titration  of  the  carboxyl  groups  of  ribonuclease  (Tanford 
and  Hauenstein^^).  Ionic  strength  is  0-15  for  the  top  curve,  0-03  for  the  middle  curve,  and 
0  01  for  the  lowest  one.  The  dashed  lines  show  the  slopes  of  Fig.  4  for  corresponding  ionic 
strengths.  The  differences  (20  to  40%)  presumably  reflect  the  relative  positions  of  various 
kinds  of  charged  sites.®** 
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Fig.  6.  Logarithmic  plot  for  titration  of  the  carboxyl  groups  of  polymethacrylic  acid 
(Arnold  and  Overbeek^).  Ionic  strength  is  0010  to  001 7  for  the  top  curve,  0-10  for  the 
middle  curve,  and  1  -0  for  the  lowest  one. 
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themselves  easily  to  interpretation  of  the  kind  of  configurational  change 
which  is  taking  place.  No  change  in  these  properties,  however,  indicates 
that  no  major  configurational  change  at  all  is  occurring. 

Serum  Albumin.  Serum  albumin  undergoes  a  reversible  change  in  configura- 
tion near  pH  4,  most  clearly  demonstrated  by  the  electrophoretic  behavior 
near  this  pH.^  Below  this  pH  the  protein  shows  the  behavior  typical  of 
simple  polyelectrolytes,  as  demonstrated  by  the  effect  of  net  charge  and 
ionic  strength  on  viscosity,^^-^^^  shown  in  Fig.  7.  The  effect  of  charge  on 


Fig.  7.  Intrinsic  viscosity  of  ribonuclease  (Buzzell  and  Tanford^")  and  of  bovine  serum 
albumin  (Yang  and  Foster^i^^  Tanford,  Buzzell  et  a/^^.gs)  Ribonuclease  at  ionic  strengths 
ranging  from  002  to  0-25  (/^),  serum  albumin  at  ionic  strengths  001  (^),  003  (£))  and 
0-15  (Q)-  The  arrows  indicate  the  isoelectric  points  of  the  two  proteins. 


pH 

Fig.  8.  Sedimentation  (^)  and  diffusion  (Q)  coefficients  for  serum  albumin.  Sedimenta- 
tion coefficients  (Harrington  et  aP^)  at  ionic  strength  01,  diffusion  coefficients  (Champagne 
and  Sadron,!*  Kekwick^e)  at  ionic  strength  0-15,  except  that  at  pH  2-3,  which  is  at  ionic 
strength  033. 
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sedimentation^^  and  diffusion  coefficients^*-'*'*  lead  to  the  same  conclusion,* 
as  shown  by  Fig.  8.  The  effect  of  ionic  strength  on  the  sedimentation  coeffi- 
cient has  been  measured  at  one  pH,^^  and  it  also  supports  this  conclusion. 
Essentially  similar  data  have  been  obtained  by  several  other  workers. *-'-^°'^* 
The  titration  of  carboxyl  groups  below  pH  4  is  also  in  accord  with  poly- 
electrolyte-like  behavior.^** 

Serum  albumin  also  undergoes  a  reversible  change  in  configuration  on 
the  alkaline  side  of  the  isoelectric  point,  above  pH  11.  The  polyelectrolyte- 
behavior  of  the  product  is  indicated  strikingly  by  the  titration  data  of 
Fig.  9.^^  There  is  a  very  pronounced  increase  in  viscosity  near  the  same 
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Fig.  9.  Logarithmic  plot  for  titration  of  the  phenolic  groups  (upper  curve)  and  amino 
groups  (lower  curve)  of  bovine  serum  albumin  (Tanford,  Swanson  and  Shore^^). 

pH.^*  Both  the  acid  and  the  alkaline  reaction  are  accompanied  by  marked 
changes  in  fluorescence  polarization  (Fig.  10)^°^  and  in  optical  rotatory 
power.^12-^^ 

An  interesting  aspect  of  these  configurational  changes  is  that  they  are  not 
symmetrical  about  the  isoelectric  point.  The  acid  change  occurs  when  the 
net  charge  is  about  +10,  the  alkahne  change  occurs  at  a  net  charge  of  about 
—  50.  It  seems  improbable  that  the  electrostatic  interaction  energy  of  a 
unique  compact  configuration  could  be  sufficiently  asymmetric  about  the 
isoelectric  point  to  account  for  this  observation.  These  data  therefore  pro- 
vide one  of  the  strongest  arguments  in  favor  of  the  idea  that  intramolecular 

*  Needless  to  say,  a  change  in  sedimentation  coefficient  alone  or  in  diffusion  coefficient 
alone  need  not  signify  a  change  in  configuration,  but  could  reflect  instead,  a  change  in 
molecular  weight.  Combination  of  the  sedimentation  and  diffusion  data  of  Fig.  8  leads 
to  the  conclusion  that  no  change  in  molecular  weight  occurs. 
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Fig.  10.  Fluorescence  polarization  of  ovalbumin  (curve  A)  and  of  serum  albumin  (curve 
C)  (Weberio5). 

hydrogen  bonds  (or  ion  pairs)  between  side  chain  groups  contribute  to  the 
stabiHty  of  a  compact  configuration.  (Since  both  the  acid  and  alkaline  reac- 
tion are  reversible,  we  are  dealing  in  this  case  with  true  thermodynamic 
stabihty.)  However,  an  alternative  explanation  is  possible.  It  is  not  neces- 
sarily true  that  serum  albumin,  or  any  other  protein,  has  a  unique  compact 
configuration.  More  than  one  such  configuration  may  exist.  That  at  the  iso- 
electric point  and  acid  to  it  may  have  a  relatively  high  electrostatic  interaction 
energy.  On  the  alkaUne  side,  where  some  of  the  charge-bearing  side  chains 
are  different  from  sites  which  are  charged  on  the  acid  side,  it  may  be  possible 
for  the  molecule  to  rearrange  itself  to  an  alternate  compact  configuration  of 
lower  electrostatic  free  energy.  (A  transition  from  model  A  of  Fig.  2  to 
model  B  would  be  an  example  of  the  kind  of  transition  here  suggested.) 
As  we  shall  mention  later,  it  has  in  fact  often  been  suggested,  for  reasons 
quite  unconnected  with  the  present  discussion,  that  serum  albumin  has  the 
ability  to  undergo  configurational  changes  of  this  kind. 

Hemoglobin.  Hemoglobin  undergoes  a  major  configurational  change,  char- 
acterized by  a  change  in  absorption  spectrum  and  in  the  titration  curve, 
near  pH  3-5. '^-^^  The  reaction  is  relatively  slow,  becoming  much  faster  with 
decreasing  pH,  and  is  slowly  reversible.  The  reaction  is  accompanied  by 
an  increase  in  intrinsic  viscosity  (at  pH  3-5  and  ionic  strength  0-04)  from 
3-5  to  13-5  cc./gram.^^  This  is  presumptive  evidence  for  a  randomly-coiled 
configuration,  though  the  effect  of  charge  and  ionic  strength  must  be  deter- 
mined to  establish  this  conclusion.  The  titration  curve^^  on  the  alkaUne 
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side  (above  pH  10)  is  sufficiently  steep  to  suggest  that  a  similar  change  in 
configuration  occurs  there. 

A  detailed  study  of  the  kinetics  of  the  acid  reaction,  in  both  directions, 
has  been  made  by  Steinhardt  and  Zaiser.^"'^^^-^^^'^^'^  Combination  of  the 
data  establishes  the  pH-dependence  of  the  equilibrium  between  the  compact 
and  expanded  form.  This  information  is  more  complete  than  similar  informa- 
tion for  any  other  protein,  although  it  is  unfortunate,  in  view  of  the  probable 
importance  of  electrostatic  forces  as  a  driving  force  of  the  reaction,  that 
all  experimental  studies  have  been  performed  at  a  single  ionic  strength.  Both 
the  equilibrium  and  the  rate  constant  for  formation  of  the  expanded  con- 
figuration depend  strongly  on  pH,  the  former  being  roughly  proportional 
to  (H+)^,  the  latter  to  (H"'")^"^.  This  marked  dependence  on  pH  has  been 
used  as  basis  for  the  hypothesis  that  a  number  of  critical  hydrogen  bonds 
must  be  ruptured  to  initiate  the  reaction.  This  hypothesis,  however,  is  entirely 
speculative.  As  Kauzmann^^  has  shown,  kinetic  behavior  similar  to  that 
observed  for  hemoglobin  will  result  if  the  reaction  depends  solely  on  an 
increase  in  electrosiaiic  interaction  energy  consequent  upon  a  decrease  in 
pH.  Kauzmanns  argument  applies  with  equal  force  to  the  equilibrium  of 
the  react  on. 

Conalbumin.  Conalbumin  undergoes  a  major  change  in  configuration  below 
pH  4,^°^  which  results  in  loss  of  solubility  under  conditions  where  the  original 
form  of  the  molecule  is  soluble.  That  the  product  of  the  reaction  is  of  the 
simple  polyelectrolyte  type  is  shown  by  the  viscosity  and  sedimentation  data 
of  Phelps  and  Cann.^^  Whereas  the  compact  form  has  an  intrinsic  viscosity 
of  3-8  cc./gram.,  the  intrinsic  viscosity,  after  the  change  in  configuration, 
is  much  larger  and  strongly  dependent  on  ionic  strength.  At  pH  3-0,  [17]  = 
16-0,  11-0  and  8-4  cc./gram,  respectively,  at  ionic  strengths  0-02,  0-07  and 
0-10.  The  sedimentation  coefficient  decreases  with  increasing  charge,  as  is 
shown  by  Fig.  11,  and  the  decrease  is  partly  suppressed  by  ionic  strength. 

On  the  alkaline  side  of  the  isoelectric  point,  near  pH  10-5,  there  is  a 
break  in  the  logarithmic  plot  of  titration  data  for  phenolic  and  amino 
groups, ^*^^  which  is  much  like  the  similar  break  shown  for  serum  albumin 
in  Fig.  9.  The  net  charge  at  this  point  is  about  the  same  as  the  net  charge 
at  which  the  acid  reaction  begins. 

Wishnia^"^  has  made  a  detailed  study  of  the  kinetics  of  the  acid  reaction, 
both  in  the  forward  and  reverse  direction.  An  interesting  feature  is  that 
the  reverse  reaction  shows  a  pH  maximum. 

Pepsin.  Pepsin  undergoes  irreversible  loss  of  its  enzymatic  activity  between 
pH  6  and  7.'^^  Edelhoch^^  has  studied  the  molecular  change  which  accom- 
panies this  reaction.  It  appears  that  a  small  part  of  the  molecule  is  split  off. 
The  remaining  portion  clearly  shows  polyelectrolyte-like  behavior.  At  pH  6, 
the  value  of  -qspjc  for  a  dilute  solution  (i.e.  essentially  the  intrinsic  viscosity) 
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Fig.  11.  Sedimentation  coefficient  of  conalbumin  at  ionic  strength  01  (Phelps  and 
Cann62)_  Osmotic  pressure  measurements  over  the  same  pH  range  indicate  that  no  change 
in  molecular  weight  takes  place.  The  arrow  shows  the  location  of  the  isoelectric  point. 


o 
o 


Q. 


4.5 

y 

// 

4,0 
3.5 

y 

^ 

40 


20 
NET  CHARGE 


Fig.  12.  Logarithmic  plot  for  titration  of  the  carboxyl  groups  of  /3-Iactoglobulin 
(Cannan,  Palmer  and  Kibrick^^),  ionic  strength  is  0  27  for  the  upper  curve  and  0  069 
for  the  lower  curve. 


3]  CONFIGURATION  OF  GLOBULAR  PROTEINS  57 

is  about  3-5  cc./gram,  independent  of  ionic  strength.  At  pH  7,  on  tiie  other 
hand,  the  same  parameter  ranges  from  21-2  to  6-7  cc./gram  as  the  ionic 
strength  is  increased  from  0-01  to  0-55.  Corresponding  changes  were  observed 
in  sedimentation  and  diffusion  coefficients. 

A  kinetic  study  of  the  loss  of  enzymatic  activity  of  pepsin  was  performed 
by  Steinhardt'^  as  early  as  1937.  The  pH  dependence  was  used  as  evidence 
that  rupture  of  specific  bonds  triggers  the  reaction.  This  conclusion  is  sub- 
ject to  the  same  comment  as  was  made  above  for  hemoglobin. 

ß-Lactoglobulin.  jS-Lactoglobulin  undergoes  an  irreversible  reaction  above 
pH  9,  the  product  of  which  is  insoluble  at  the  isoelectric  point.  The  kinetics 
of  the  reaction  have  been  investigated  by  Groves,  Hipp  and  McMeekin.^^ 
That  the  product  is  of  the  simple  polyelectrolyte  type  is  indicated  by  the 
titration  data  for  the  phenolic  groups  shown  in  Fig.  13.^^ 
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Fig.  13.  Logarithmic  plot  for  titration  of  the  phenolic  groups  of  /3-lactoglobulin  (Tan- 
ford  and  Swanson^«)  at  ionic  strengths  0  08  (Q)  and  0-27  (^).  The  dashed  lines  show  the 
slopes  of  corresponding  plots  for  titration  of  carboxyl  groups  (Fig.  12).  The  difference 
in  slopes  is  roughly  a  factor  of  ten. 

j3-Lactoglobulin  is  another  protein  with  marked  asymmetry  about  the 
isoelectric  point.  The  titration  data  for  carboxyl  groups, ^^  shown  in  Fig.  12, 
are  characteristic  of  a  compact  protein  up  to  the  highest  positive  charges 
reached.  Viscosity  measurement  as  acid  as  pH  1-7  also  give  no  indication 
of  any  change  in  configuration. ^^^  In  this  protein  it  is  thus  certain  that 
electrostatic  forces  alone  cannot  account  for  the  effect  of  pH  on  con- 
figuration. 


y-Globulin.  Jirgensons*"  has  shown  that  the  viscosity  and  optical  rotatory 
power  of  y-globulin  depend  on  pH  much  like  the  corresponding  properties 
of  serum  albumin.  His  data  are  shown  in  Fig.  14.  It  should  be  noted  that 
the  isoelectric  point  of  y-globulin  is  considerably  higher  than  that  of  serum 
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albumin,  so  that  the  changes  indicated  by  Fig.  14  represent  behavior  which 
is  much  more  nearly  symmetrical  about  the  isoelectric  point. 

It  should  be  noted  that  the  expanded  configurations  of  all  of  the  proteins 
discussed  above,  though  they  behave  qualitatively  like  simple  polyelectro- 
lytes,  do  not  have  intrinsic  viscosities  as  large  as  those  of  simple  poly- 
electrolytes  of  similar  molecular  weight  and  charge.  This  result  is  to  be  expected, 
as  has  been  mentioned,  because  disulfide  cross-links  limit  the  expansion  which 
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Fig.  14.  Intrinsic  viscosity  (0)  and  specific  rotation  (Q)  of  y-globulin  at  ionic  strength 
0-1  (Jirgensons*").  This  protein  is  a  mixture  of  components  with  different  isoelectric  points, 
the  average  of  which  is  near  pH  7. 

protein  molecules  can  undergo.  It  is  important  to  point  out,  in  addition, 
that  it  is  virtually  impossible  to  distinguish  configurations  consisting  of  a 
mixture  of  compact  and  randomly  coiled  parts  from  a  configuration  which 
is  completely  random  except  for  cross-links.  In  other  words,  though  the 
measurements  cited  clearly  show  polyelectrolyte-like  properties,  this  does 
not  yet  provide  a  complete  picture  of  the  expanded  configurations. 

We  shall  list  next,  some  globular  proteins  which  are  much  more  resistant 
to  configurational  change  than  those  discussed  above. 


Ribonuclease.  In  contrast  to  the  proteins  just  discussed,  ribonuclease  is  highly 
resistant  to  configurational  change.  As  Fig.  7  shows,  its  intrinsic  viscosity 
is  virtually  independent  of  charge  and  ionic  strength  between  pH  1  and 
pH  11-5.^"  At  pH  11-5,  there  is  a  configurational  change  which  brings  three 
phenolic  groups  from  the  interior  of  the  molecule  to  the  surface.^^  Whether 
this  reaction  also  leads  to  expansion  is  not  known.  At  pH  1-0  and  ionic 
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Strength  0-15,  the  intrinsic  viscosity  is  4-5  cc./gram,  in  contrast  to  the  value 
of  3-5  cc./gram  observed  at  the  same  pH  at  ionic  strength  0-25.^"  At  pH  2 
(ionic  strength  unspecified)  there  is  also  a  change  in  absorption  spectrum,'^ 
the  steepness  of  which  with  respect  to  pH  suggests  that  a  configurational 
change  is  occurring.  It  is  thus  possible  that  ribonuclease  tends  to  acquire 
a  polyelectrolyte-Uke  configuration  at  the  extreme  acid  end  of  its  titration 
curve  if  the  ionic  strength  is  sufiiciently  low. 

The  release  of  phenohc  groups  by  ribonuclease,  which  occurs  at  or  above 
pH  11 '5,  does  not  take  place  under  reversible  conditions.  As  was  pointed 
out  earUer,  this  prevents  us  from  deciding  whether  the  stability  of  the  original 
compact  configuration  of  this  protein  represents  true  thermodynamic  sta- 
bility, or  whether  it  merely  reflects  a  requirement  for  a  larger  free  energy  of 
activation.  One  of  the  most  fruitful  approaches  to  an  understanding  of 
ribonuclease  and  other  proteins  which  change  configuration  under  irrever- 
sible conditions  would  be  a  search  for  conditions  under  which  a  reversal 
to  a  compact  configuration  occurs.  From  a  study  of  such  a  reversal,  one 
could  obtain  a  measure  of  the  true  thermodynamic  stabiHty  of  the  compact 
configuration. 

Ovalbumin.  Ovalbumin,  Hke  ribonuclease,  undergoes  no  major  configura- 
tional change  between  pH  2  and  pH  11-5.  This  is  shown  by  the  titration 
curve  of  Cannan  and  coworkers,  ^^  which  fits  the  compact  sphere  model 
throughout  this  range  of  pH,  and  by  the  absence  of  change  in  the  polariza- 
tion of  fluorescence,  shown  in  Fig.  10.^°^  The  viscosity  data  of  Bull^  are 
not  entirely  unequivocal,  because  tjs^/c  was  determined  at  each  pH  at  a 
single  concentration  and  not  extrapolated  to  give  the  intrinsic  viscosity.  At 
the  very  low  ionic  strengths  used  in  some  of  his  experiments,  this  creates 
considerable  uncertainty.  Within  this  uncertainty  the  results  confirm  the 
conclusion  reached  from  the  other  data  cited.  ^ 

Ovalbumin  behaves  Hke  ribonuclease  in  another  respect.  Its  phenofic 
groups,  initially  inaccessible  to  titration,  are  slowly  released  near  pH  12  or 
higher. ^^  Whether  this  process  is  accompanied  by  expansion  is  not  known. 

Insulin.  In  many  ways,  insufin  is  the  best  known  of  all  globular  proteins. 
However,  only  scant  information  is  available  about  the  effect  of  pH  on  its 
configuration.  There  is  certainly  no  configurational  change  in  acid  solutions, 
as  has  been  shown  by  viscosity  measurements  of  Yang  and  Foster.^^^  The 
titration  curve  in  acid  solution  is  anomalous,^*  but  the  anomaly  is  fully  ex- 
plained on  the  basis  of  molecular  weight  changes  which  this  protein  is  known 
to  undergo  (see  below). 

Lysozyme.  Lysozyme  is  another  protein  for  which  no  expansion  has  yet 
been  observed.  Steiner"  has  shown  that  the  polarization  of  fluorescence  of 
dye  molecules  bound  to  lysozyme  remains  unchanged  between  pH  3-5  and 
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pH  10.  Yang  and  Foster^^^  have  demonstrated  that  the  intrinsic  viscosity 
is  constant  between  pH  2-3  and  pH  7-5.  The  titration  curve  of  lysozyme^"**'^" 
shows  no  feature  suggestive  of  expansion  at  any  pH, 

Apart  from  the  major  configurational  changes  discussed  so  far,  there  is 
hkeUhood  of  the  occurrence  of  minor  changes  in  configuration.  There  may, 
for  example,  be  several  different  arrangements  of  the  polypeptide  chain  (or 
chains)  which  have  'hydrophobic'  regions  of  comparable  stability.  A  change 
in  pH  and  the  resultant  change  in  the  number  and  positions  of  charged  sites 
may  stabilize  one  such  form  at  the  expense  of  another,  A  similar  situation 
can  arise  if  hydrogen  bonds  contribute  to  the  stability  of  the  original  com- 
pact form.  The  rupture  of  some  of  these  bonds  by  titration  could  lead,  if 
the  net  charge  is  still  low,  to  a  new  compact  configuration  with  a  new  set 
of  hydrogen  bonds. 

There  are  small  changes  in  the  sedimentation  coeflücient  of  jS-lactoglobulin 
near  pH  5  and  pH  8,^^  which  might  reflect  such  minor  changes  in  configura- 
tion. A  similar  small  change  in  sedimentation  coefficient  occurs  for  oval- 
bumin, near  pH  4.^^  It  would  be  desirable  to  obtain  confirmatory  evidence 
of  these  changes  by  other  methods. 

Minor  configurational  changes  need  not  necessarily  be  accompanied  by 
changes  in  hydrodynamic  properties.  Klotz  and  coworkers,  for  example, 
have  observed  changes  in  the  ability  of  serum  albumin  to  bind  calcium^^ 
and  certain  anions^"  and  uncharged  molecules,^^  which  they  have  ascribed 
to  a  configurational  change.  The  change  is  observed  near  pH  7,  where  no 
change  in  hydrodynamic  properties  can  be  detected.^^  Karush^^  jj^s  sug- 
gested that  the  unusual  ability  of  serum  albumin  to  bind  ions  of  all  kinds  is 
evidence  for  quite  general  configurational  adaptability.  That  these  observa- 
tions based  on  binding  properties  reflect  configurational  changes  is  not  an 
unassailable  conclusion,  for  alternative  interpretations  are  possible. 

POLYMERIZATION    OF    PROTEINS 

This  paper  has  dealt  with  the  various  forces  which,  in  water  solution,  attract 
one  portion  of  a  protein  molecule  to  another  portion  of  the  same  molecule. 
It  is  obvious  that  the  same  forces  must  be  able  to  act  intermolecularly  and 
thus  bring  about  polymerization.  Whether  polymerization  will  actually  take 
place,  depends  on  the  extent  to  which  these  forces  have  been  able  to  be 
utilized  intramolecularly. 

Association  between  protein  molecules  in  their  compact  configurations 
occurs  in  some  proteins  and  not  in  others.  It  does  not  occur  in  ribonuclease,^^ 
the  molecular  weight  of  which  is  independent  of  pH  or  ionic  strength  be- 
tween pH  2  and  pH  11.  It  does  not  occur  in  serum  albumin. ^^  The  very  slow 
association' '^^-^^  which  this  protein  undergoes  is  observed  only  on  stand- 
ing over  a  period  of  days,  and  then  only  subsequent  to  loss  of  the  compact 
configuration.^^  Insulin,  on  the  other  hand,  is  polymerized  at  all  pH  values 


3]  CONFIGURATION  OF  GLOBULAR  PROTEINS  61 

in  aqueous  solution. ^"•^^•"  The  extent  of  polymerization  is  lowest  when 
the  molecule  bears  a  high  net  charge,  positive  or  negative.  It  increases  as 
the  isoelectric  point  is  approached,  and,  in  the  isoelectric  region  itself,  leads 
to  insolubility  of  the  protein.  This  is  clearly  the  expected  result  if  electro- 
static forces  predominate  in  the  pH-dependence  of  intermolecular  attraction. 
Hemoglobin^^  is  another  globular  protein  which  fits  this  picture.  It  exists 
as  a  dimer  near  its  isoelectric  point  and  dissociates  reversibly  as  the  pH  is 
lowered.  Dissociation  is  probably  complete  before  the  expansion  earlier 
referred  to  occurs. 

The  most  interesting  example  of  the  polymerization  of  a  compact  protein 
molecule  is  that  which  occurs  for  j8-lactoglobuHn.^^'^"2.io3  Experimental 
molecular  weights  for  this  protein,  as  observed  by  Townend  and  Timasheff, 
are  shown  in  Fig.  15.  The  most  interesting  feature  is  the  polymerization  of 
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Fig.  15.  The  molecular  weight  of  iS-lactoglobulin  near  25°  (dashed  line)  and  near  2° 
(solid  line)  (Townend  and  Timasheff  102,103) .  (j  25°  at  the  lowest  feasible  concentration, 
as  determined  by  the  Archibald  method;  Q  25°,  miscellaneous  observations;  0  2°,  at 
concentrations  of  2  to  3  %,  calculated  from  the  relative  areas  of  separate  sedimentation 
peaks  representing  species  of  molecular  weight  35,000  and  70,000.  The  arrow  locates  the 
isoelectric  point. 


the  dimer,  at  0°C,  which  occurs  entirely  on  one  side  of  the  isoelectric  point. 
These  observations,  coupled  with  the  earlier  observation  that  ^S-lactoglobulin 
becomes  randomly  coiled  on  the  alkaline  side  of  the  isoelectric  point,  and 
not  on  the  acid  side,  make  this  one  of  the  most  interesting  of  all  globular 
proteins. 

It  might  at  first  sight  seem  probable  that  globular  proteins  in  a  randomly 
coiled  configuration  should  be  more  liable  to  polymerization  than  the  same 
proteins  in  compact  configurations.  However,  the  forces,  such  as  electrostatic 
repulsion,  which  lead  to  the  preference  for  a  randomly  coiled  configuration, 
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will  also  operate  against  polymerization.*  An  interesting  situation  arises, 
however,  when  an  expanded  protein  is  suddenly  returned  to  near  its  iso- 
electric point.  The  randomly  coiled  configuration  is  now  no  longer  the 
stable  one,  but  return  to  a  compact  configuration  may  require  specific  coil- 
ing of  the  polypeptide  chains,  and,  as  a  result,  may  be  a  slow  reaction.  Under 
these  conditions  polymerization  may  compete  with  return  to  a  compact 
shape,  even  if  the  latter  is  thermodynamically  stable.  Insolubility  at  the  iso- 
electric point,  which  is  so  often  a  consequence  of  denaturation,  may  be 
explained  in  this  way.  The  configurational  changes  described  earher  for 
serum  albumin  and  hemoglobin  are  reversible,  so  that  return  to  a  compact 
shape  is  the  favoured  reaction  for  these  proteins.  In  view  of  Kauzmann's 
suggestion**  that  the  number  of  disulfide  bonds  may  determine  the  rate  of 
return  to  a  compact  configuration,  it  is  interesting  to  note  that,  of  these  two 
proteins,  serum  albumin  has  a  large  number  of  disulfide  bonds  (about  16), 
but  hemoglobin  has  none. 

CONCLUSION 

The  most  important  conclusion  which  can  be  drawn  from  this  paper  is  that 
our  ignorance  of  the  configuration  of  proteins  in  aqueous  solution  greatly 
exceeds  the  few  facts  which  have  been  established.  It  appears  possible  that 
the  final  answer  to  the  problems  which  have  been  posed  may  be  different 
for  each  of  the  proteins  which  have  been  discussed.  Moreover,  a  complete 
theory  will  have  to  account  not  only  for  the  properties  of  these  globular 
proteins,  but  it  must  also  explain  why  proteins  such  as  collagen  appear  to 
have  no  stable  compact  configuration  at  all  in  aqueous  solution. 
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über  die  Artspezifität  der  Proteinstruktur 
H.  TUPPY 

II.  Chemisches  Institut  der  Universität  Wien 


In  Organismen,  welche  verschiedenen  Arten,  Gattungen,  Ordnungen  usw. 
angehören,  werden  übereinstimmende  physiologische  und  biochemische  Lei- 
stungen in  der  Regel  von  Proteinen  nicht  gleicher,  aber  ähnlicher  Struktur 
vollbracht.  Mit  anderen,  der  biologischen  Terminologie  entlehnten  Worten: 
Analoge  Proteine,  d.h.  solche,  die  in  verschiedenartigen  Organismen  vor- 
kommend einander  in  funktioneller  Hinsicht  gleichen,  stehen  zueinander 
häufig  auch  im  Verhältnis  der  Homologie,  d.h.  einer  weitreichenden  struk- 
turellen Übereinstimmung,  die  auf  eine  genetische  Verwandtschaft  hinweist. 
Die  Tatsache,  dass  funktionsgleiche  Proteine  sich  im  Rahmen  der  Homologie 
voneinander  durch  spezifische  strukturelle  Merkmale  unterscheiden,  wird 
gewöhnlich  mit  dem  Terminus  Artspezifität  gekennzeichnet  (womit  jedoch 
nicht  präjudiziert  werden  soll,  dass  sich  die  Variationen  der  Proteinstruktur 
an  die  Einteilung  der  Organismen  in  Arten  hält). 

Während  die  Klassifikation  der  Organismen  ausschliesslich  nach  mor- 
phologischen Merkmalen,  nach  der  Homologie  des  anatomischen  Baues, 
erfolgt,  steht  bei  der  Beschreibung  und  Einteilung  der  physiologisch  wirk- 
samen Proteine  der  funktionelle  Aspekt  im  Vordergrund.  Dennoch  ist  auch 
bei  Proteinen  ein  'vergleichend-anatomisches'  Studium  von  Interesse.  Denn 
einerseits  kann  man  füglich  einen  Zusammenhang  suchen  zwischen  mehr 
oder  weniger  weitreichenden  Homologien  der  Proteinstruktur  und  engeren 
oder  ferneren  genetischen  Beziehungen  der  Organismen,  denen  die  unter- 
suchten homologen  Proteine  entstammen.  Andererseits  lässt  sich  von  der 
Kenntnis  der  Artspezifität  der  Proteine  ein  besseres  Verständnis  der  struk- 
turellen Voraussetzungen  ihrer  biologischen  Wirkungen  erhoffen:  dürften 
doch  nur  die  unveränderlichen  Strukturmerkmale  mit  der  gleichbleibenden 
biologischen  Wirkung  in  Zusammenhang  stehen,  nicht  jedoch  die  je  nach 
Herkunft  variablen. 

Nur  in  Ausnahmefällen  ist  die  Artspezifität  der  Proteinstruktur  leicht 
erkennbar,  wie  z.  B.  bei  Vertebraten-Hämoglobinen  durch  auffallende  Unter- 
schiede der  Kristallform  und  der  Löslichkeiten.  Im  allgemeinen  bedarf  die 
Feststellung  der  Artunterschiede  einer  eingehenderen  Prüfung.  Eine  seit  Jahr- 
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zehnten  erprobte  Methode  ist  die  immunologische:  Homologe  Proteine  sind 
(mit  gewissen  Ausnahmen)  spezifische  Antigene  und  ihre  Antigen-Spezifität 
ist  umso  ähnlicher,  je  näher  sie  einander  in  der  biologischen  Klassifizierung 
stehen.  Bisweilen  lassen  sich  Proteine  verschiedener  Herkunft  auch  durch 
eine  unterschiedliche  elektrophoretische  Beweglichkeit  unterschieden.  Seit- 
dem in  den  letzten  anderthalb  Jahrzehnten  die  Aminosäure-Analytik  einen 
so  hohen  Grad  der  Genauigkeit  erreicht  hat,  dass  sie  auch  die  geringfügigen 
Unterschiede  in  der  Aminosäure-Zusammensetzung  sehr  ähnhcher  Proteine 
eindeutig  feststellen  kann,  verdanken  wir  der  Baustein-Analyse  homologer 
Eiweisskörper  ebenfalls  wichtige  Hinweise.  Hier  sei  beispielsweise  die  von 
Harfenist  und  Craig  (1952  b)  ^°  am  Insulin  dreier  Säugetierarten  vorgenommene 
Aminosäurebestimmung  hervorgehoben;  diese  zeigte,  dass  bei  ansonsten 
gleicher  Zusammensetzung  der  homologen  Insuhnmoleküle  an  drei  ihrer  51 
Aminosäurebausteine  auf  Konto  der  Artspezifität  gehende  Verschiedenheiten 
nachweisbar  sind.  Besonders  hervorzuheben  ist  auch  die  analytische  Unter- 
suchung, die  Knight  (1947)  an  einer  Reihe  von  Pflanzenviren,  die  dem 
Tabakmosaikvirus  nahestehen,  vorgenommen  hat;  er  fand,  dass  einander 
in  pathologischer  Hinsicht  nahe  verwandte  Stämme  eine  identische  oder  sehr 
ähnliche,  biologisch  entfernter  verwandte  Stämme  hingegen  eine  deutlich 
verschiedene  Aminosäure-Zusammensetzung  besitzen.  Seine  Befunde  gaben 
der  Vermutung  Nahrung,  dass  sich  in  der  Proteinstruktur  engere  oder  ent- 
ferntere genetische  Beziehungen  widerspiegeln.  Noch  einen  Schritt  weiter 
führten  Feststellungen  der  Natur  der  in  den  Polypeptidketten  homologer 
Proteine  endständigen  Aminosäurereste;  bei  der  ersten  vergleichenden  End- 
gruppen-Untersuchung, die  von  Porter  und  Sanger  (1948)  vorgenommen 
worden  ist,  wurde  ebenfalls  der  ermutigende  Befund  erhoben,  dass  chemische 
Gleichheiten  und  Verschiedenheiten  bis  zu  einem  gewissen  Grade  mit  sero- 
logischen parallel  gingen  und  der  biologischen  Klassifizierung  der  Organis- 
men, von  denen  die  Hämoglobine  stammten,  entsprachen. 

Für  einen  genaueren  Einblick  in  die  Natur  der  Artspezifität  der  Pro- 
teinstruktur ist  jedoch  die  Bestimmung  der  Aminosäure-Reihenfolge  in 
Polypeptidketten  homologer  Proteine  erforderlich.  Solche  vergleichende 
Sequenz-Untersuchungen  sind  in  den  letzten  Jahren  an  einigen  Stellen  durch- 
geführt worden.  Hier  ist  vor  allem  die  detailHerte  Aufklärung  der  Artdif- 
ferenzen im  Insulin  fünf  verschiedener  Säugetier-Spezies  hervorzugeben,  die 
wir  Sanger  und  seinen  Mitarbeitern  verdanken  (Brown,  Sanger  und  Kitai, 
1955;  Harris,  Sanger  und  Naughton,  1956).  Diese  und  andere  Homologie- 
Studien  werden  zusammenfassend  besprochen,  nachdem  über  eigene  ver- 
gleichende Untersuchungen  an  Cytochrom-c  verschiedener  Herkunft  berichtet 
worden  ist. 

Cytochrom-c  ist  für  eine  chemische  Bearbeitung  der  Artspezifität  der 
Proteinstruktur  besonders  geeignet.  Denn  es  zeichnet  sich  (wie  keineswegs 
die  meisten  anderen  Eiweisstoff'e)  durch  ein  nahezu  ubiquitäres  Vorkommen 
in  der  Welt  der  Organismen  aus:  Von  den  niedersten  bis  zu  den  höchsten 
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Lebewesen,  in  Mikroorganismen,  in  Pflanzen  und  in  Tieren  ist  Cytochrom-c 
nachgewiesen  worden.  Bei  seiner  Auffindung  und  Isolierung  leisten  seine 
charakteristischen  Absorptionsbanden  unschätzbare  Hilfe.  Es  besitzt  ein 
niedriges  Molekulargewicht  (in  der  Grössenordnung  von  12000)  und  über- 
lebt dank  seiner  verhältnismässig  robusten  Konstitution  die  Unbilden  einer 
Gewinnung  und  Reinigung  leichter  als  andere,  empfindUchere  Proteine. 

Eine  besondere  und  vorteilhafte  EigentümUchkeit  der  Cytochrom-c- 
Struktur  besteht  darin,  dass  im  Hämoprotein-Molekül  die  prosthetische 
Gruppe  mit  dem  Eiweissanteil  durch  zwei  ziemhch  stabile  Thioätherbrücken 
verknüpft  ist.  Diese  überdauern  sogar  einen  hydrolytischen  Abbau  des  Pro- 
teins. Theorell  (1939)  sowie  Zeile  und  Meyer  (1939)  konnten  aus  sauren 
Hydrolysaten  des  Cytochroms-c  eine  als  Porphyrin-c  bezeichnete  Verbin- 
dung isolieren,  welche  man  sich  durch  Addition  der  Thiolgruppen  von  zwei 
Cysteinresten  an  die  Vinylseitenketien  in  Stellung  2  und  4  eines  Protopor- 
phyrins aufgebaut  denken  kann. 
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Im  intakten  Cytochrom-c  enthält  die  prosthetische  Gruppe  natürlich  noch 
Eisen  als  Zentralatom  und  die  beiden  porphyrin-gebundenen  Cysteinreste 
sind  mit  ihren  Amino-  und  Carboxylgruppen  peptidisch  in  die  Eiweisskom- 
ponente  des  Hämoproteins  eingefügt.  Man  kann  sich  diese  Verhältnisse 
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zunutze  machen  und  beim  Abbau  des  Cytochroms-c  die  gefärbte  pros- 
thetische Gruppe  als  Markierung  des  ihr  benachbarten  Teiles  des  Apopro- 
teins verwenden.  Bei  einem  Studium  der  Artunterschiede  in  Cytochrom-c 
verschiedener  Herkunft  bietet  die  Untersuchung  gerade  des  porphyrinnahen 
Teiles  ihrer  Aproproteine  noch  einen  weiteren  Vorteil;  sie  gibt  absolute 
Sicherheit,  dass  in  den  homologen  Molekülen  analoge  Regionen  vergUchen 
werden,  die  in  der  gleichen  sterischen  und  funktionellen  Beziehung  zur 
prosthetischen  Gruppe  stehen. 

Wenn  Cytochrom-c  mit  verdünnter  Schwefelsäure  schonend  hydrolysiert 
wird,  so  geht  der  Abbau  nicht  bis  zum  Porphyrin-c,  sondern  es  finden  sich 
im  Hydrolysat  auch  Peptide  des  Porphyrins-c,  Verbindungen  also,  in  denen 
mit  den  porphyrin-gebundenen  Cysteinresten  noch  andere  Aminosäuren 
verknüpft  sind.  Diese  gefärbten  Produkte  lassen  sich  von  den  im  Hydro- 
lysat vorkommenden  freien  Aminosäuren  und  porphyrinfreien  Peptiden 
durch  Adsorption  an  Talk  abtrennen.  Unterwirft  man  Porphyrin-c  und 
Porphyrin-c-Peptide  der  Silbersulfat-Behandlung,  welche  Paul  (1950)  mit 
Erfolg  zur  Spaltung  des  intakten  Cytochroms-c  in  Hämatoporphyrin  und 
Apoprotein  verwendet  hat,  so  werden  unter  Sprengung  der  Thioätherbrücken 
Cystein  und  Cysteinpeptide  in  Freiheit  gesetzt.  Oxydation  mit  Perameisen- 
säure verwandelt  diese  in  Cysteinsäure  und  Cysteinsäurepeptide,  welche 
durch  lonophorese  und  Papierchromatographie  voneinander  getrennt  werden 
können.  (Tuppy  und  Bodo,  1954  a).  Die  bei  der  Untersuchung  von  Rinder- 
Cytochrom-c  erhaltenen  und  identifizierten  Cysteinsäurepeptide  zeigten  (siehe 
Tabelle  1,  Teil  A),  dass  einer  der  beiden  Cysteinreste  des  Porphyrins-c  von 
Lysin  und  Alanin,  der  andere  von  Glutaminsäure  und  Histidin  flankiert  ist 
(Tuppy  und  Bodo,  1954  c). 

Ein  weiterer  Einblick  in  die  der  prosthetischen  Gruppe  benachbarte 
Region  des  Apoproteins  wurde  durch  einen  enzymatischen  Abbau  des 
Cytochroms-c  gewonnen  (Tuppy  und  Paléus,  1955).  Bei  der  Verdauung  des 
Hämopro teins  mit  Pepsin  entsteht,  wie  Tsou  schon  1951  gezeigt  hat,  ein 
niedrigermolekulares  Abbauprodukt,  welches  rot  gefärbt  ist  und  die  intakte 
prosthetische  Gruppe  des  Ferments  enthält.  Tsou's  'pepsinmodifiziertes 
Cytochrom-c'  war  jedoch  nicht  einheitlich;  es  bedurfte  noch  einer  Reihe 
von  Reinigungsoperationen  (Fällungen  mit  Ammonsulfat  und  Trichlores- 
sigsäure,  Ausflockung  beim  isoelektrischen  Punkt  und  Verteilungschromato- 
graphie auf  Hyflo-Supercel-Säulen),  bis  ein  wirklich  reines  Produkt  vorlag. 
Dieses  'Hämopeptid'  enthielt,  wie  eine  Aminosäureanalyse  nach  Stein  und 
Moore  ergab,  abgesehen  von  den  zwei  prophyrin-gebundenen  Cysteinresten, 
je  einen  Threonin-,  Alanin-,  Histidin-  und  Lysin-Rest,  zwei  Valin-  und  drei 
Glutaminsäure-Reste,  insgesamt  also  1 1  Aminosäurebausteine.  Diese  sind  in 
einer  einzigen  Polypeptidkette  angeordnet,  an  deren  Aminoende,  wie  mit 
der  DNP-Technik  (Sanger,  1945)  festgestellt  wurde,  ein  Valinrest  sitzt,  ge- 
folgt von  Glutaminsäure.  Vom  Aminoende  stammt  auch  ein  Tripeptid,  Val- 
Glu(NH2)-Lys,   welches   vom    Hämopeptid   bei   Inkubation   mit   Trypsin 
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abgespalten  wird.  Zur  Aufklärung  der  vollständigen  Aminosäuresequenz  im 
Hämopeptid  wurde  dieses  zuerst  mittels  Paul's  Silbersalzmethode  in  seine 
Hämatin-  und  seine  Polypeptid-Komponente  zerlegt;  in  letzterer  wurden 
mit  Perameisensäure  die  zwei  Cysteinreste  oxydiert,  mit  dem  Ergebnis,  dass 
wir  ein  zwei  Cysteinsäure-Reste  enthaltendes,  farbloses,  porphyrin-freies 
Undekapeptid  erhielten.  Dieses  wurde  sowohl  mit  Salzsäure  partiell  hydro- 
lysiert  als  auch  mit  Subtilisin  proteolytisch  gespalten;  in  beiden  Fällen  ent- 
standen eine  Reihe  von  Abbauprodukten,  aus  deren  Struktur  die  Anordnung 
der  1 1  Aminosäurereste  in  der  ungespaltenen  Polypeptidkette  ohne  weiteres 
erschlossen  werden  konnte  (Tabelle  1 ,  Teil  B). 

Wird  Cytochrom-c  statt  mit  Pepsin  mit  Trypsin  abgebaut,  so  erhält  man 
(nach  entsprechender  Reinigung)  ein  dem  oben  beschriebenen  Hämopeptid 
sehr  ähnliches  Spaltprodukt  (Tuppy  und  Bodo,  1954  b),  dessen  Struktur 
ebenfalls  durch  partielle  Säurehydrolyse  ermittelt  wurde  (Tabelle  1,  Teil  C). 
Der  Polypeptidanteil  besteht  hier  aus  nur  9  Aminosäureresten;  die  Nona- 
peptid-Sequenz  ist  auf  der  N-terminalen  Seite  um  3  Aminosäurereste  kürzer 
und  auf  der  C-terminalen  um  einen  Rest  länger  als  die  im  peptischen  Abbau- 
produkt vorgefundene  Undekapeptid-Sequenz. 

Da,  wie  schon  erläutert,  die  durch  die  prosthetische  Gruppe  markierte 
Region  des  Cytochrom-c-Apoproteins  ein  dankbares  Objekt  für  Homologie- 
Studien  ist,  wurde  sie  ausser  in  Rinder-Cytochrom-c  auch  in  zwei  anderen 
Säugetier-Cytochromen,  dem  des  Pferdes  und  des  Schweines,  untersucht. 
Tint  und  Reiss  (1950)  waren  auf  Grund  elektrophoretischer  Versuche  zur 
Auffassung  gelangt,  dass  die  Säugetier-Cytochrome  artspezifisch  seien.  Die 
von  uns  untersuchte,  auf  eine  Sequenz  von  10  Aminosäuren  beschränkte 
Region  war  jedoch  in  den  3  Cytochromen  vollständig  gleich  (Tuppy  und 
Bodo,  1954  c)  (Tabelle  2).  Die  nächste  Prüfung  betraf  die  peptischen  Abbau- 
produkte eines  Fisch-und  eines  Vogel-Cytochroms-c  (Tuppy  und  Paléus, 
1955).  In  dem  des  Lachses  erwies  sich  die  Aminosäuresequenz  ebenfalls 

Tabelle  2 

VERGLEICH  EINER   HOMOLOGEN  REGION  IN  CYTOCHROM-C 
VERSCHIEDENER   HERKUNFT 


Rind 
Pferd 
Schwein 
Lachs 
Huhn 
Seiden- 
spinner 
Hefe 


. . .  -Val-Glu(NH2)-Lys-CyS-Ala-Glu(NH2)-CyS-His-Thr-Val-Glu-Lys- 

-Lys-CyS-Ala-GIu(NH2)-CyS-His-Thr-Val-Glu-Lys- 

-Lys-CyS-Ala-Glu(NH2)-CyS-His-Thr-Val-Glu-Lys- 

. .  .-Val-Glu(NHo)-Lys-CyS-Ala-Glu(NH2)-CyS-His-Thr-VaI-Glu-  . . . 
. .  .-Val-Glu(NH2)-Lys-CyS-5('/--Glu(NH2)-CyS-His-Thr-Val-Glu-. . . . 


. .  .-Val-Glu(NH2)-/4/-g-CyS-Ala-Glu(NH2)-Cys-His-Thr-Val-Glu-. 
-Phe-Lys-Thr Arg-CyS-Glu-Leu- CyS-His-Thr-Val-Glu- . 


als  mit  der  in  Säugetier-Cytochromen  ermittelten  identisch,  in  Hühner- 
Cytochrom-c  hingegen  war  an  die  Stelle  eines  Alanin-Restes  ein  Serin-Rest 
getreten.  Ein  Invertebraten-Cytochrom-c,  das  des  Seidenspinners  Bombyx 
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mori  (Tuppy,  1957),  enthielt  zwischen  den  beiden  porphyrin-gebundenen 
Cy stein- Resten  wieder  einen  Alanin-Rest,  so  wie  die  Säugetier-Cytochrome; 
dafür  ersetzte  ein  Arginin-  den  bisher  angetroffenen  Lysin-Rest.  Eine  be- 
deutendere Abweichung  von  diesen  sehr  ähnlichen  Aminosäuresequenzen 
wurde  erst  beobachtet,  als  nicht  mehr  tierisches,  sondern  Hefe-Cytochrom-c 
zur  Untersuchung  kam  (Tuppy  und  Dus,  1957).  Aus  Bäckerhefe  konnten 
zwei  Hämoproteine  mit  Cytochrom-c-Aktivität  extrahiert  und  voneinander 
durch  Ionenaustauscher-Chromatographie  getrennt  werden.  Eines  von  ihnen, 
welches  im  Hefe-Extrakt  in  reichlicherer  Menge  vorkommt,  wurde  mit 
Pepsin  abgebaut;  die  unterste  Zeile  der  Tabelle  2  zeigt  die  Aminosäure- 
sequenz im  peptischen  Spaltprodukt. 

Die  untersuchten  homologen  Cytochrome,  ob  sie  nun  vom  Säugetier, 
Fisch,  Vogel,  Insekt  oder  sogar  von  Hefe  stammen,  sind  einander  spektro- 
skopisch äusserst  ähnlich;  ihre  Absorptionsbanden  im  sicht-baren  Teil  des 
Spektrums  sind  ununterscheidbar.  Auch  entsprechen  sie  einander  in  ihrer 
katalytischen  Wirksamkeit;  sie  sind  alle  imstande,  die  spezifische  Cytochrom- 
c-Funktion  im  Succinoxydase-System  des  Säugetiers  zu  erfüllen.  Offenbar  ist 
nicht  nur  die  prosthetische  Gruppe  im  engeren  Sinne  stets  die  gleiche,  son- 
dern es  bleiben  auch  die  Art  ihrer  Verknüpfung  mit  dem  Proteinanteil  und 
die  Natur  jener  Gruppen  des  Proteins,  die  dem  'aktiven  Zentrum'  zuzurech- 
nen sind,  von  Artunterschieden  unberührt. 

Zu  den  charakteristischen  Invariablen  der  aufgeklärten  homologen  Amino- 
säuresequenzen gehören  der  stets  gleiche  Abstand  der  beiden  porphyrin- 
gebundenen  Cysteinreste  und  das  benachbarte  Vorkommen  eines  Histidin- 
Restes.  Wie  ein  von  Ehrenberg  und  Theorell  (1955)  gebautes  Modell  zeigte, 
ist  die  Verknüpfung  der  zwei  Cysteinreste  der  Polypeptidkette  mit  den 
Vinylgruppen  der  prosthetischen  Gruppe  in  zwangloser  Weise  möglich, 
wenn  die  Cysteinreste  voneinander  durch  zwei  andere  Aminosäurereste  ge- 
trennt sind  und  die  Polypeptidkette  die  Form  einer  a-Helix  hat;  durch  diese 
sekundäre  Struktur  der  Polypeptidkette  kommt  der  Histidin-Rest  in  eine 
räumliche  Lage,  die  eine  Bindung  eines  seiner  Imidazol-StickstoflFatome  an 
das  Eisen  der  prosthetischen  Gruppe  geradezu  nahelegt.  Dieser  Histidinrest 
stellt  aller  Wahrscheinlichkeit  nach  einen  Teil  des  'aktiven  Zentrums'  des 
Cytochrom-c-Moleküls  dar;  er  ist  als  einer  von  jenen  zwei  Aminosäureresten 
der  Proteinkomponente  zu  betrachten,  die  mit  'häm-gebundenen'  basischen 
Gruppen  die  Hämochromogen-Natur  des  Cytochroms-c  bewirken  (Theorell 
und  Âkeson,  1941). — Ob  auch  noch  andere  in  allen  untersuchten  Cyto- 
chromen  als  gleich  befundene  Strukturelemente  (so  z.  B.  die  Sequenz  Thr- 
Val-Glu)  darum  invariabel  sind,  weil  sie  eine  besondere  funktionelle  Be- 
deutung besitzen,  ist  derzeit  nicht  zu  beantworten. 

Wenn  wir  von  Hefe-Cytochrom-c  absehen  und  vorerst  nur  die  unter- 
suchten Tier-Cytochrome  vergleichend  betrachten,  stellen  wir  fest,  dass  sich 
die  Artspezifität  der  Struktur  in  der  aufgeklärten  Polypeptidregion  auf  den 
Austausch  einzelner  Aminosäurereste  gegen  andere,  ähnhche  beschränkt 
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(Lysin  gegen  Arginin,  Alanin  gegen  Serin).  Gleichartige  Befunde  sind  von 
anderen  Autoren  an  anderen  homologen  Polypeptiden  und  Proteinen  erhoben 
worden.  Es  ist  ebenfalls  der  Erstaz  eines  Lysin-Restes  durch  einen  Arginin- 
Rest,  der  das  Vasopressin  des  Schweines  von  dem  des  Rindes  unterscheidet 
(Du  Vigneaud,  Lawler  und  Popenoe,  1953),  und  der  Austausch  eines  Alanins 
gegen  Serin  ist  einer  der  geringen  Unterschiede  zwischen  Schweine-  und 
Schaf-Corticotropin  (Bell,  1954;  Li,  Geschwind,  Cole,  Raacke,  Harris  und 
Dixon,  1955).  Drei  Paare  ähnlicher  Aminosäurereste,  Alanin  und  Threonin, 
Glycin  und  Serin,  Valin  und  Isoleucin,  vertreten  einander  in  den  von  Sanger 
und  Mitarbeitern  miteinander  verglichenen  fünf  homologen  Säugetier- 
Insulinen  (Tabelle  3)  (Harris,  Sanger  und  Naughton,  1956).  Das  alternative 


Tabelle  3 

ARTUNTERSCHIEDE  IN   INSULIN 

(NACH  HARRIS,   SANGER  UND 

NAUGHTON,   1956) 


Rind 

. .  .-CyS-Ala-Ser-Val-CyS-. . . 

Schwein 

. .  .-CyS-Thr-Ser-Ileu-CyS-. . . 

Schaf 

. .  .-CyS- Ala-Gly-Val-CyS-. . . 

Pferd 

. .  .-CyS-Thr-Gly-Ileu-CyS-. . . 

Wal 

. .  .-CyS-Thr-Ser-Ileu-CyS-. . . 

Vorkommen  von  Alanin  und  Threonin  ist  ausserdem  auch  in  homologen 
Serumalbuminen  (Thompson,  1954),  von  Valin  and  Isoleucin  im  Hypertensin 
des  Rindes  und  Pferdes  (Elliott  und  Peart,  1956;  Skeggs,  Lentz,  Kahn, 
Shumway  und  Woods,  1956),  von  Glycin  und  Serin  in  Hämoglobin  ver- 
schiedener Herkunft  (Ozawa  und  Satake,  1955)  beobachtet  worden;  in 
Hämoglobinen  wurde  auch  ein  Ersatz  von  Valin  durch  Methionin  und  von 
Glutaminsäure  durch  Asparaginsäure  gefunden  (Porter  und  Sanger,  1948; 
Ozawa  und  Satake,  1955).  Eine  der  Artspezifität  entsprechende  'Stammes- 
spezifität'  stellten  Niu  und  Fraenkel-Conrat  (1955)  in  der  Proteinkomponente 
des  Tabakmosaikvirus  fest;  in  vier  Stämmen  lautet  die  C-terminale  Sequenz 
Thr-Ser-Gly-Pro-Ala-Thr,  in  einem  fünften  kommen  anstelle  von  Serin  und 
Glycin  die  diesen  ähnlichen  Aminosäuren  Alanin  und  Threonin  vor. 

Wenn  wir  nun  die  aufgeklärte  Aminosäuresequenz  im  Hefe-Cytochrom-c 
mit  derjenigen  im  Cytochrom-c  verschiedener  Tiere  vergleichen,  so  sehen 
wir  an  mehreren  einander  entsprechenden  Stellen  der  Polypeptidketten 
unähnliche  Aminosäurereste  stehen.  Diese  Feststellung  trifft  z.  B.  auf  jene 
zwei  Aminosäuren  zu,  welche  die  beiden  porphyrin-gebundenen  Cystein- 
Reste  voneinander  trennen;  hier  finden  wir  Alanin  bzw.  Serin  durch  Glu- 
taminsäure ersetzt  und  Glutamin  durch  Leucin.  Für  eine  solche  gegenseitige 
Vertretung  unähnlicher  Aminosäuren  sind  bisher  nur  vereinzelte  andere  Bei- 
spiele beschrieben  worden.  In  zwei  homologen  Melanophoren-stimuherenden 
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Hormonen  ersetzen  einander  Serin  und  Glutaminsäure  (Geschwind,  Li  und 
Barnafi,  1957;  Harris  und  Roos,  1956),  in  zwei  homologen  corticotropen 
Hormonen  Alanin  und  Leucin  (Bell,  1954;  Li,  Geschwind,  Cole,  Raacke, 
Harris  und  Dixon,  1955). 

Ein  Ziel  solcher  vergleichender  Untersuchungen  wäre  die  Kenntnis  der 
Regeln,  welche  die  Abwandlung  der  Struktur  in  homologen  Proteinen  in 
Abhängigkeit  von  ihrer  biologischen  Stellung  bestimmen;  leider  lassen  sich 
derartige  Regeln,  wenn  es  solche  überhaupt  gibt,  aus  den  beschränkten  bisher 
erhobenen  Befunden  nicht  ableiten. 

Eine  wichtige  Frage  ist  die,  ob  kleine  Unterschiede  der  Aminosäuresequenz 
in  Proteinen,  etwa  Auswechslungen  einzelner  Aminosäurereste  gegen  ähnliche 
andere,  wie  sie  im  Rahmen  von  Artunterschieden  festgestellt  worden  sind, 
auch  innerhalb  derselben  Organismenart  auftreten  können.  Bei  der  Struktur- 
aufklärung des  Insulins  ist  kein  Hinweis  gefunden  worden,  dass  es  Alter- 
nativen der  Aminosäuresequenzen  in  den  Polypeptidketten  gäbe,  wodurch 
eine  Mikroheterogenität  der  primären  Proteinstruktur  zustande  käme  (Sanger 
und  Tuppy,  1951;  Sanger  und  Thompson,  1953).  Zwei  voneinander  durch 
Gegenstromverteilung  getrennte  Formen  des  Insulins  (Harfenist  und  Craig, 
1952  a)  unterscheiden  sich  nicht  in  der  Aminosäureanordnung,  sondern  nur 
durch  eine  Säureamidgruppe  (Harfenist,  1953).  Neuerdings  halten  jedoch 
Grassmann,  Strobel,  Hannig  und  DefiFner-Plöckl  (1956)  eine  Mikrohetero- 
genität der  Fraktion  A  des  Rinder-Insulins  (ein  alternatives  Vorkommen 
von  Isoleucin  und  Valin)  dennoch  für  möglich.  Die  Untersuchung  der 
porphyrin-nahen  Region  des  Cytochrom-c-Apoproteins  förderte,  wenigstens 
bisher,  keinen  intraspezifischen  Sequenzunterschied  zutage.  Auch  bei  der 
Strukturaufklärung  der  Polypeptid-Hormone  Oxytocin,  Vasopressin,  Corti- 
cotropin, Intermedin  und  Glukagon  wurden  diese  als  homogen  befunden. 

Andererseits  sind  in  letzter  Zeit  mehrere  Fälle  bekannt  geworden,  in  denen 
Proteine,  aus  Individuen  der  gleichen  Art  gewonnen,  in  zwei  oder  zahl- 
reicheren ähnlichen  Formen  auftreten.  Das  bekannteste  Beispiel  ist  das 
menschliche  Hämoglobin,  das  nicht  nur  in  einer  normalen  adulten  Form 
und  einer  fötalen  Form  existiert,  sondern  auch  in  einer  Reihe  abnormaler 
Formen  (Zusammenfassung  bei  Itano,  1956),  unter  denen  das  Sichelzellanämie- 
Hämoglobin  das  erstgefundene  (Pauling,  Itano,  Singer  und  Wells,  1949)  und 
bestuntersuchte  ist.  Die  Synthese  einiger  von  diesen  verschiedenen,  elektro- 
phoretisch  unterscheidbaren  Hämoglobinen  steht  unter  der  Kontrolle  alleler 
Gene.  Noch  ist  eine  genaue  strukturelle  Differenzierung  der  einzelnen  Hämo- 
globine  nicht  erfolgt,  durch  die  sicher  erwiesen  wäre,  ob  die  Verschiedenheit 
in  der  Aminosäuresequenz  oder  aber  in  der  sekundären  Struktur  der  Protein- 
moleküle liegt.  Die  vergleichende  Untersuchung  von  Ingram  (1956)  an  nor- 
malem und  an  Sichelzell-Hämoglobin  spricht  für  das  erstere  ;  die  Unterschiede 
scheinen  nur  sehr  gering  zu  sein  und  könnten  in  der  Grössenordnung  der  bei 
Artspezifitätsstudien  gefundenen  liegen. 
Vom  biologischen  Standpunkt  aus  erscheint  es  durchaus  befriedigend,  dass 
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ein  und  dasselbe  Protein  innerhalb  derselben  Spezies  in  zwei  oder  mehreren 
homologen  Formen  vorkommen  kann,  die  durch  Einzelheiten  der  Amino- 
säuresequenz unterschieden  sind.  Wenn  die  Proteinsynthese  genetisch  ge- 
steuert ist  und  wenn  eine  Änderung  der  Aminosäuresequenz,  wie  sie  im 
Rahmen  von  Artunterschieden  auftritt,  Ausdruck  und  Folge  einer  inter- 
spezifischen Verschiedenheit  von  Erbfaktoren  ist,  welche  die  Proteinsynthese 
kontrollieren,  so  lässt  sich  auch  bei  intraspezifischem  Vorkommen  ver- 
schiedener, für  die  Proteinsynthese  verantwortlicher  Erbfaktoren  (bei 
Heterozygotie  eines  Individuums  bzw.  Heterogenic  einer  Population  art- 
gleicher Individuen)  das  Auftreten  eines  Proteins  mit  Verschiedenheiten  in 
der  Aminosäuresequenz  erwarten.  In  einem  heterozygoten  Organismus  sollte 
ein  Protein  in  zwei  verschiedenen  Formen,  in  einer  heterogenen  Population 
in  ebensovielen  Formen  vorkommen  können,  als  Allele  des  betreffenden 
Erbfaktors  existieren. 

Ob  dem  wirklich  so  ist,  dass  sich  in  Änderungen  der  Aminosäureanordnung 
in  Proteinen — direkt  oder  eher  indirekt — mutative  Veränderungen  von  Genen 
widerspiegeln,  wird  die  Zukunft  zeigen.  Jedenfalls  aber  sichert  der  genetische 
und  entwicklungsgeschichtliche  Aspekt  den  experimentellen  Untersuchungen 
sowohl  der  intra-  als  auch  der  interspezifischen  Unterschiede  der  Protein- 
struktur ein  besonderes  Interesse. 
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Comparative  studies  in  the  field  of 
protein  microstructure 

F.   SORM 

Institute  of  Chemistry, 
Czechoslovak  Academy  of  Science,  Prague 


Work  in  the  field  of  protein  microstructure  is  at  present  being  directed  along 
two  fundamental  lines  :  the  first  of  these  is  characterized  by  efforts  to  deter- 
mine the  complete  structure  of  the  simplest  proteins;  work  along  the  second 
line  seeks  to  discover  general  regularities  in  protein  structure  on  the  basis 
of  comparative  studies  of  lower  fission  products  of  the  proteins.  Practical 
difficulties  at  present  limit  the  scope  of  the  first  line  of  research  to  the  higher 
natural  peptides  and  proteins  of  very  low  molecular  weight.  In  spite  of  these 
limitations,  a  number  of  notable  successes  has,  as  is  generally  known,  been 
achieved  in  this  field,  culminating  at  present  in  the  determination  of  the 
complete  structure  of  ribonuclease.^  The  exact  determination  of  the  amino 
acid  sequence  in  the  peptide  chains  of  the  simpler  proteins  and  of  natural 
peptides  undoubtedly  also  contributes  valuable  information  to  our  know- 
ledge of  the  general  laws  governing  protein  structure.  The  results  of  the 
work  carried  out  so  far  must,  however,  for  the  present  be  evaluated  with 
some  caution  in  this  respect,  since  the  compounds  concerned  are  mostly 
possessed  of  highly  specific  biological  activities  which  very  probably  reflect 
a  corresponding  specificity  of  chemical  structure.  For  these  reasons,  the 
second  fundamental  line  of  protein  research,  which  is  not  limited  by  the 
molecular  weight  of  the  materials  studied,  is  gaining  increasing  importance. 
The  aim  of  such  research  may  be  either  the  elucidation  of  regularities  in 
the  arrangement  of  amino  acids  within  the  protein  chains — a  line  hitherto 
pursued  mainly  in  work  on  the  fibrous  proteins — or  a  search  for  resemblances 
in  the  chemical  structure  of  proteins  related  in  their  biological  function  or 
origin.  The  search  for  such  resemblances  in  protein  structure  has  up  to  the 
present  more  particularly  involved  those  parts  of  the  peptide  chains  whose 
nature  makes  them  amenable  to  direct  comparison  at  the  present  level  of 
experimental  technique.  Such  is  the  case,  for  instance,  with  the  amino  acid 
groupings  terminating  the  peptide  chains,  which  may  be  exactly  determined 
by  identification  of  the  terminal  amino  acids  or  peptide  sequences,  A  second 
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approach  to  this  problem  involves  the  separation  from  partial  hydrolysates 
and  the  further  study  of  such  peptides  as  differ  markedly  in  some  manner 
from  the  other  peptides  in  the  hydrolysate,  and  which  may  by  virtue  of  these 
differences  be  isolated  as  a  separate  group.  A  very  interesting  contribution 
of  this  kind  is  the  work  of  Tuppy  who,  by  a  comparison  of  the  coloured 
peptides  obtained  by  the  partial  hydrolysis  of  cytochrome-c,  established  the 
identity  of  the  amino  acid  sequence  in  the  neighbourhood  of  the  point  of 
attachment  of  the  pigment  in  a  number  of  animal  species.^ 

In  our  laboratory  we  have  for  some  time  past  been  carrying  out  com- 
parative studies  on  the  microstructure  of  certain  proteins,^  in  particular  of 
chymotrypsinogen  and  trypsinogen,  and  of  the  haemoglobins  and  myoglobins 
of  various  animal  species.  In  this  work  we  concentrate  on  the  isolation  and 
comparison  of  lower  peptides  containing  amino  acids  which  constitute  only 
a  relatively  small  fraction  of  the  total  residue  number  of  the  proteins  studied 
and  which,  moreover,  are  easily  separated  from  the  other  peptides.  These 
conditions  are  met  particularly  in  the  case  of  peptides  containing  cysteic 
acid  (derived  from  cystine  and  cysteine  by  oxidation),  arginine,  lysine,  or 

Table  1 
FRACTIONATION    SCHEME    FOR    PEPTIDE    MIXTURES 


Partial  hydrolysate 


five-compartment  electrophoresis 


basic  peptides 


multi-  Dowex  50 

compartment 
electrophoresis 


(descending 

paper 

electrophoresis) 


paper  paper 

chromatography    chromatography 


neutral  peptides 


oxidation, 

five-compartment 

electrophoresis 


acidic  peptides 


cysteic  acid 
peptides 


neutral 
peptides 


paper  paper 

chromatography    chromatography 


individual 
peptides 


individual 
peptides 


individual 
peptides 


individual 
peptides 
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histidine.  For  practical  reasons  we  initially  confined  ourselves  to  the  analysis 
of  partial  acid  hydrolysates  in  which  about  50  per  cent,  of  the  peptide  bonds 
had  been  preserved  intact  and  which  consisted  of  relatively  low  molecular 
weight  peptides.  The  isolation  of  the  cysteic  acid  peptides  was  ai  first  car- 
ried out  by  means  of  ion  exchange  on  Amberlite  IRA-4B,  that  of  the  arginine 
peptides  on  Amberlite  IRA-400,  particularly  in  those  cases  where  only  a 
statistical  analysis  of  the  overall  composition  of  the  appropriate  peptide 
fractions  was  carried  out.  Later,  when  we  undertook  the  isolation  and 
identification  of  the  individual  peptides  contained  in  each  fraction,  we 
developed  a  fractionation  procedure  whose  essentials  are  set  out  schematic- 
ally in  Table  1. 

As  an  example  of  the  statistical  procedure  in  comparing  the  microstructure 
of  proteins,  some  of  our  results  on  the  serum  albumins  of  a  number  of 
species — man,  ox,  horse,  sheep,  and  duck — may  be  given. ^  As  is  shown  in 
Table  2,  the  amino  acid  composition  of  all  five  proteins  is  very  similar.  The 

Table  2 
AMINO    ACID    COMPOSITION    OF    SOME    SERUM    ALBUMINS 


Species 

Amino  acid 

ox 

man 

horse 

duck 

sheep 

Cysteic  acid 

6-5  ±0-3 

6-3  ±11 

7-6±0-6 

6-9±0-2 

6-1   ±0-28 

Lysine + 

histidine 

16-8  ±005 

15-3  ±0-19 

1705±018 

13-3±0-3 

17-1   ±0-28 

Arginine 

5-2  ±0-82 

6-25  ±0-30 

5-2  ±0-21 

5-4±016 

5-6  ±014 

Aspartic  acid 

10-9  ±0-3 

11-9  ±00 

10-2  ±01 

131±0-2 

11-6  ±0-25 

Serine 

4-2  ±0-12 

3-53±0-4 

4-18±0-22 

61±012 

4-3  ±0-12 

Glycine 

1-8  ±005 

l-62±005 

2-34±0-14 

2-5±0-14 

2-5  ±0-16 

Glutamic  acid 

16-5  ±0-3 

17-2  ±00 

16-65±0-48 

161±0-22 

15-33±0-36 

Threonine 

5-8  ±0-5 

3-93±0-36 

3-6  ±0-26 

41±0-12 

5-2  ±0-12 

Alanine 

6-25  ±0-08 

8-2  ±0-2 

7-85±0-45 

6-6±015 

6-7  ±007 

Proline 
Tyrosine 

4-7  ±00 
50  ±01 

4-6  ±00 

3-75±0-8 

4-3  ±00 
3 -88  ±002 

|lO-8±0-15 

|9-3  ±0-3 

Methionine 

0-8  ±003 

l-77±0-12 

— 

4-5±0-2 

10  ±0  09 

Valine 

5-9  ±0-35 

7-76±0-3 

40  ±0-23 

4-2±0-26 

6-8  ±0-23 

Phenylalanine 

6-6  ±1-02 

8-1   ±0-87 

8-25±0-77 

6-6±0-36 

90  ±0-57 

Leucine + 

isoleucine 

14-8  ±0-42 

15-18±0-5 

15-10±0-77 

ll-3±0-39 

14-6  ±0-26 

Methionine 

0-76 

M4 

0-46 

1-14 

0-81 

Tyrosine 

5-62 

5-20 

4-43 

5-66 

5-67 

Tryptophan 

1-43 

0-88 

1-08 

0-62 

Ml 

The  figures  denote  grams  of  amino  acid  per  100  g.  of  protein;  they  are  average  values 
from  8  independent  determinations  (6  in  the  case  of  glutamic  acid,  threonine,  methionine, 
vahne,  and  phenylalanine).  The  probable  error  is  indicated  after  each  value.  The  values 
for  methionine,  tyrosine  and  tryptophan  recorded  in  the  last  three  lines  were  obtained 
by  specific  colorimetric  methods. 
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quantitative  determination  of  the  individual  amino  acids  in  the  total  hydro- 
lysate  was  here,  as  in  all  other  analyses  recorded  in  this  paper,  carried  out 
by  photometric  measurement  on  paper  chromatograms  by  the  method  of 
B.  Keil,^  Table  3  shows  the  overall  amino  acid  composition  of  the  arginine 
peptide  fraction  isolated  from  the  partial  acid  hydrolysates  of  the  five  serum 
albumins  by  means  of  Amberlite  IRA-400.  The  table  reveals  marked  dif- 
ferences in  the  amounts  of  several  of  the  amino  acids,  evidently  connected 
with  differences  in  the  microstructure  of  the  individual  serum  albumins.  In 
particular,  the  conspicuously  high  threonine  content  of  the  peptide  fractions 
obtained  from  ox  and  sheep  albumin  may  be  noted,  as  well  as  the  very  low 
tyrosine  content  of  the  peptides  from  duck  serum  albumin.  This  last  feature 
is  particularly  characteristic  since  all  the  native  proteins  contain  about  the 
same  amount  of  tyrosine. 

Table  3 

AMINO  ACID  COMPOSITION  OF  TOTAL  HYDROLYSATES   OF  THE 
ARGININE  PEPTIDES  FROM  SOME  SERUM  ALBUMINS 


Species 

Amino  acid 

ox 

man 

horse 

duck 

sheep 

Cysteic  acid 

4-5 

5-3 

4-8 

7-4 

3-8 

Lysine +histidine 

9-2 

13-3 

10-5 

12-4 

9-7 

Arginine 

28-5 

29-9 

29-6 

301 

24-4 

Serine + glycine 

6-6 

4-2 

51 

6  0 

5-4 

Glutamic  acid+ 

+  threonine 

9-8 

2-8 

4-3 

4-1 

9-3 

Alanine 

5-2 

5  0 

5-9 

6-3 

60 

Proline 

11-5 

12-3 

13-2 

11-3 

110 

Tyrosine 

3-7 

4-5 

3-6 

0-4 

1-6 

Methionine + valine 

3-7 

5-3 

2-9 

3-3 

60 

Phenylalanine 

0-9 

1-6 

1-6 

0-9 

3-2 

Leucine + isoleucine 

16-2 

18-7 

18-3 

16-7 

19-8 

Our  main  concern  in  the  work  carried  out  so  far  has  been  with  the  com- 
parison of  chymotrypsinogen  and  trypsinogen.  These  two  proteins — both  of 
approximately  the  same  molecular  weight — are  formed  by  a  single  type  of 
pancreatic  cell,  resemble  each  other  in  their  enzymic  function,  and  can  be 
obtained  crystalline  in  a  state  of  high  purity.  In  choosing  these  proteins  we 
further  considered  that  the  results  of  our  work  might  contribute  to  the 
determination  of  their  full  structures — a  problem  which  will  doubtlessly 
receive  considerable  attention  in  the  near  future. 

The  amino  acid  composition  of  the  two  enzymogens  as  determined  in 
our  laboratory  is  set  out  in  Table  4,  which  reveals  considerable  differences 
between  the  two  proteins  in  this  respect.  Chymotrypsinogen  contains  much 
less  serine  than  trypsinogen  and  more  threonine  and  alanine.  The  Table 
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Table  4 

AMINO  ACID   COMPOSITION   (RESIDUE  NUMBERS) 

OF    BOVINE    CHYMOTRYPSINOGEN    AND 

TRYPSINOGEN 


Amino  acid 

Chymotrypsinogen 

Trypsinogen 

Alanine 

19 

13 

Glycine 

20 

21 

Valine 

20 

15 

Leucine 

16 

12 

Isoleucine 

9 

12 

Proline 

8 

7 

Phenylalanine 

6 

4 

Tyrosine 

4 

9 

Tryptophan 

6 

— 

Serine 

26 

38 

Threonine 

20 

(9-11) 

Cysteine  (half-cystine) 

9-10 

11-13 

Methionine 

(1) 

1 

Arginine 

4 

2 

Histidine 

2 

3 

Lysine 

12 

14 

Aspartic  acid 

19 

24 

Glutamic  acid 

10 

10 

Amide  N 

23 

(23  ±3) 

further  shows  that  the  residue  numbers  of  the  basic  amino  acids,  particularly 
of  arginine  and  histidine,  in  chymotrypsinogen  and  trypsinogen  (4  and  2, 
and  2  and  3  respectively)  are  well  suited  for  our  comparative  studies,  as  well 
as  the  relatively  low  number  of  cysteine  (half-cystine)  residues  (9-10  and 
11-13,  respectively).  At  the  outset  we  again  carried  out  a  statistical  com- 
parison of  the  amino  acid  composition  of  total  hydrolysates  of  the  cysteic 
acid  peptide  fractions  isolated  from  the  partial  hydrolysates  of  the  per- 
formic  acid — oxidized  proteins  by  means  of  Amberhte  IRA-4B.^  Table  5^ 
in  which  the  amino-acid  compositions  of  the  total  hydrolysates  of  several 
proteins  and  those  of  the  corresponding  cysteic  acid  peptides  are  set  out, 
indicates  that  the  degree  of  resemblance  between  the  amino  acid  composition 
of  hydrolysates  obtained  from  the  cysteic  acid  peptides  of  chymotrypsino- 
gen and  trypsinogen,  respectively,  is  about  the  same  as  that  between  the 
composition  of  the  two  whole  proteins.  At  the  same  time,  however,  the 
table  reveals  significant  differences  between  the  composition  of  hydrolysates 
of  the  cysteic  acid  peptides  of  the  two  enzymogens  on  the  one  hand,  and 
of  the  remaining  proteins  on  the  other. 

The  statistical  evaluation  of  the  composition  of  the  cysteic  acid  peptide 
fractions  naturally  gives  only  very  approximate  information  about  resem- 
blances in  the  structure  of  the  proteins  under  examination;  the  results  have 

Fps 


Table 
STATISTICAL    COMPARISON    OF    THE    AMINO    ACID    COMPOSITION    OF    SOME 


Chymotrypsino- 
gen 

Proteins 

Leu+ 
Ileu 

Val 

Ala 

Thr 

Glu 

Ser.  Gly, 
Met 

Asp 

% 

â 

% 

J 

% 

A 

% 

zJ 

% 

à 

% 

â 

% 

J 

S 

161 
13-4 
100 

10-1 
8-4 
100 

7-6 
6-3 
100 

11-4 
9-5 
100 

9  0 
7-5 
100 

18-2 
151 
100 

11-3 
9-4 
100 

Trypsinogen 

18-2 
14-9 
111 

11 

9  0 
7-3 
87 

13 

5-8 
4-7 
75 

25 

5-4 
4-4 
46 

54 

10-7 
8-7 
116 

16 

20-5 
16-7 
111 

11 

7i-6 
IM 
118 

18 

±104 

Serum   albumin 
bovine 

14-8 
131 
98 

2 

5-9 
5-2 
62 

38 

6-2 
5-5 
87 

13 

5-5 
51 
54 

46 

/5-5 
14-6 
195 

95 

6-8 
6  0 
40 

60 

70-9 
9-6 
102 

2 

±19-7 

Serum   albumin 
human 

12-7 
11-9 
89 

11 

7-7 
7-2 
86 

14 

6-2 
5-8 
92 

8 

5-0 
4-7 
49 

51 

17-4 
16-3 
218 

118 

6-6 
6-2 
41 

59 

10-4 
9-7 
103 

3 

±22  0 

Ovalbumin 

16-2 
14-8 
110 

10 

7-1 
6-5 
77 

23 

6-7 
61 
97 

3 

4-0 
3-6 
38 

62 

iö-5 
150 
200 

100 

16-4 
150 
99 

1 

9i 
8-5 
90 

10 

±186 

Insulin 

160 
141 
105 

5 

7-5 
6-9 
82 

18 

¥•5 
4  0 
63 

37 

2-7 
1-9 
20 

80 

18-6 
16-4 
218 

118 

9-5 
8-4 
56 

44 

6-8 
6  0 
64 

36 

±24-5 

Globin  bovine 

15-4 
14-4 
107 

7 

91 
8-5 
101 

1 

7-4 
6-9 
110 

10 

4-4 
41 
43 

57 

5-5 
7-9 
105 

5 

12-4 
11-6 
11 

23 

7Ö-Ö 
9-9 
105 

5 

±  9-7 

Fibrinogen    bo- 
vine 

11-9 
10-8 
81 

19 

41 
3-7 
44 

56 

i-7 

3-4 
54 

46 

61 
5-5 
58 

42 

14-5 
13-2 
176 

76 

15-2 
13-8 
91 

9 

13-1 
11-9 
126 

26 

±18  2 

In  the  columns  headed  %  the  italic  numbers  denote  the  number  of  grams  of  amino  acid  obtained  from 
lUU  g.  ot  protein,  the  numbers  m  roman  type  denote  the  same  results  recalculated  to  add  up  to  100%  and  the 
ngures  m  bold-face  type  denote  the  amount  of  the  amino  acid  expressed  as  a  percentage  of  the  amount  of  the 
same  ammo  acid  present  in  chymotrypsinogen.  In  the  columns  headed  'cm='  the  italic  figures  denote  the  area 


PROTEINS   AND   THE   CORRESPONDING   ACIDIC   PEPTIDE    FRACTIONS 


Acidic  Peptides 

Leu  +  lieu 

Val 

Ala 

Thr 

Glu 

Ser,  Gly, 
Met 

Asp 

cni- 

A 

cm- 

à 

cm' 

J 

cm' 

J 

cm' 

A 

cm' 

J 

cm' 

A 

S 

24-6 
89  0 
100 

13-6 
49-2 
100 

22-5 
81-4 
100 

15-3 
55-3 
100 

54-9 
198-4 
100 

55-5 
200-5 
100 

21-7 
78-5 
100 

29-5 
117-7 
132 

32 

11-0 
43-9 
89-2 

11 

20-2 
80-6 
99  1 

1 

10-5 
42-0 
76  0 

24 

48-0 

163  0 

82-2 

18 

52-8 
210-5 
105  0 

5 

22-2 

88-5 

1130 

13 

±  7-5 

22-4 
94-3 
106 

6 

7-5 
31-6 
64  1 

36 

18-4 
77-5 
95  2 

5 

11-6 
49-0 
88-5 

11 

73-2 
308-5 
155  5 

56 

16-2 
68-4 
341 

66 

18-4 
11-5 
98-8 

1 

±148 

20-7 
901 
101 

1 

11-5 

50-0 

101-5 

2 

25-2 
109-8 
135 

35 

7-0 
30-5 
54-2 

46 

76-4 
332-2 
167-2 

67 

77-5 
77-5 
38-6 

61 

24-7 
107-5 
137-0 

37 

±17-5 

191 
94-4 
106 

6 

20-# 
100-9 
205  0 

105 

19-8 

97-8 

120  1 

20 

5-5 
22-5 
40-7 

59 

56-2 
278-0 
140  0 

40 

26-6 

131-2 

65-5 

34 

18-0 

89-0 

1132 

13 

±12-8 

1-4 
18-2 
155 

55 

23-2 

77-5 

157-6 

58 

19-1 
63-8 
78-5 

21 

¥-0 
13-4 
24-2 

76 

104-6 
349-0 
1760 

76 

46-7 

156-0 

77-8 

22 

24-3 

81-2 

103  5 

4 

±21  1 

27-5 

5-0 

73  0 

27 

5J-6 
126-8 
257-5 

158 

22-5 
53-2 
65-4 

35 

15-4 
36-4 
65  8 

34 

105-3 
249-0 
125-5 

26 

54-0 

119-0 

59  4 

41 

39-5 
93-4 
119 

19 

±27-0 

4-5 

6  0 

85-4 

15 

33.1 

16-7 

208  0 

108 

19-4 

102-7 

74  0 

26 

12  0 
61  0 
67-2 

33 

37-2 
135 

35 

52-8 

268-0 

81-5 

18 

5i-5 
163-7 
135-2 

35 

±19  8 

r- 

under  the  peaks  of  the  photometric  curves,  the  figures  in  roman  type  the  same  results  referred  to  1000  as  the 
sum  of  all  areas,  and  the  numbers  in  bold-face  type  the  results  expressed  as  percentages  of  the  appropriate 
value  for  chymotrypsinogen.  The  columns  headed  A  list  the  differences  from  the  chymotrypsinogen  values, 
and  h  is  the  mean  deviation  from  the  chymotrypsinogen  values. 
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been  given  here  more  particularly  for  the  reason  that  this  method  might 
have  some  justification  in  the  comparison  of  highly  complex  proteins  where 
there  is  no  possibility  of  isolating  and  identifying  a  reasonable  proportion 
of  individual  peptides.  In  our  further  work  on  the  cysteic  acid  peptides  of 
chymotrypsinogen  and  trypsinogen*  we  undertook  the  isolation  and  identi- 
fication of  individual  peptides.  For  this  purpose  we  used  a  cysteic  peptide 
fraction  obtained  by  oxidizing  the  neutral  peptide  fraction,  isolated  by  elec- 
trophoresis in  a  five-compartment  apparatus,  with  performic  acid;  the 

Table  6 

CYSTEIC    ACID    PEPTIDES    ISOLATED    FROM    PARTIAL   ACID 
HYDROLYSATES     OF     BOVINE     CHYMOTRYPSINOGEN     AND 

TRYPSINOGEN 


Chymotrypsinogen 

Trypsinogen 

1 

his.cys 

his.cys 

2 

leu.(pro,  val).cys 
g]y.(leu,  pro,  val).cys 

val.cys 
gly.(pro,  val).cys 

3 

val.cys 
ala.val.cys 
(val,  cys,  glu) 

ser.cys 
cys.ala 
cys.ala.gly 

4 

thr.cys 

cys.val 
thr.cys,  val 
ser.thr.cys 

thr.cys 

5 

phe.cys 
cys.gly 
(phe,  cys).gly 

phe.cys 
cys.gly 

6 

gly.cys.ser 

ser.gly.cys 

7 

ileu.cys 
ileu.cys.ala 
cys.ala 

cys.Ieu 

8 

■  ser.cys 

cys.met 
ser.cys.met 
ser.(cys,  met,  ileu) 

(cys,  met,  lieu).ala 

9 

thr.asp.cys 

(ser,  glu).cys 

*  Actually,  trypsin  was  used  in  these  experiments  since  it  differs  from  the  enzymogen 
by  a  peptide  sequence  irrelevant  to  this  particular  study. 
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Strongly  acidic  portion  was  roughly  fractionated  on  Dowex  2  and  the  indi- 
vidual peptides  were  isolated  by  repeated  paper  chromatography.  The  in- 
dividual cysteic  acid  peptides  obtained  from  both  proteins  are  listed  in 
Table  6.'  It  is  evident  at  first  sight  that  our  partial  hydrolysates  of  trypsin 
yielded  a  far  smaller  number  of  peptides  than  the  partial  hydrolysates  of 
chymotrypsinogen;  this  may  be  due  to  differences  in  the  extent  of  hydro- 
lysis in  the  two  cases. 

It  is  further  evident  from  Table  6  that,  taking  in  conjunction  those  pep- 
tides which  are  certainly  or  probably  connected,  it  is  possible  to  make  out 
nine  different  sequences  for  the  immediate  environment  of  the  cysteic  acid 
residues  in  chymotrypsinogen;  this  corresponds  exactly  to  the  number  of 
cysteine  (half-cystine)  residues  thought  to  be  present  in  the  chymotrypsino- 
gen molecule.  It  may  also  be  seen  that,  although  the  number  of  cysteic  acid 
peptides  isolated  from  the  partial  hydrolysate  of  trypsin  is  much  smaller, 
their  structures  conform  very  closely  to  four  of  the  peptide  sequences  found 
in  the  case  of  chymotrypsinogen. 

For  the  isolation  of  the  peptides  of  arginine,  histidine  and  lysine  from  the 
partial  acid  hydrolysates  of  chymotrypsinogen  and  trypsin  we  used  the  basic 
peptide  fractions  obtained  in  a  five-compartment  electrophoretic  apparatus. 
These  were  further  subjected  to  electrophoresis  in  a  multi-compartment 
apparatus  under  conditions  permitting  the  direct  separation  of  a  weakly 
basic  fraction  composed  predominantly  of  histidine-containing  peptides,  and 
at  the  same  time  a  rough  fractionation  of  the  strongly  basic  peptides  of 
arginine  and  lysine.  The  final  isolation  of  individual  peptides  was  achieved 
by  repeated  paper  chromatography  in  several  solvent  systems. 

The  compositions  or  structures  of  the  individual  peptides  isolated  in  this 
manner  are  summarized  in  Table  7  f-  ^^  the  results  show  that  we  have  suc- 
ceeded in  establishing  the  amino  acid  sequence  in  the  immediate  vicinity 
of  two  of  the  four  arginine  residues  present  in  chymotrypsinogen,  and  con- 
firm a  third  sequence  (leu.ser.arg.ileu.)*  which  had  previously  been  estab- 
lished from  activation  studies.  The  amino-acid  sequence  about  the  fourth 
residue  is  probably  val.arg.asp  or  val.arg.glu,  since  we  found  bound  arginine 
to  occur  in  the  neutral  peptide  fractions  of  the  hydrolysate.  Of  the  dipeptides 
isolated  from  partial  hydrolysates  of  trypsin,  two  correspond  in  structure 
with  sequences  identified  in  chymotrypsinogen. 

Table  7  also  shows  the  structures  of  the  histidine  peptides  isolated  in  this 
work.^  Here  again  we  were  able,  to  a  fair  degree  of  certainty,  to  establish 
the  immediate  environment  of  both  the  histidine  residues  present  in  chymo- 
trypsinogen. The  peptides  isolated  from  trypsin  show  agreement  with 
chymotrypsinogen  only  for  one  of  the  three  histidine  residues;  it  further 
appears  that  the  sequence  phe.his  in  chymotrypsinogen  may  be  replaced  by 
tyr.his  in  trypsinogen. 

*  This  sequence  has  been  confirmed  by  enzymic  hydrolysis  just  before  this  paper  was 
given. 
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Table  7 

PEPTIDES   OF   ARGININE   AND    HISTIDINE    ISOLATED 
FROM    PARTIAL    ACID    HYDROLYSATES    OF    BOVINE 
CHYMOTRYPSINOGEN   AND   TRYPSINOGEN 


[5 


Chymotrypsinogen 

Trypsinogen 

ser.arg 

ser.arg 

1 

arg.ileu 
leu.ser.arg.ileu* 

ala.arg. 

1 

arg.leu 

2 

arg.  val 
ala.arg.val 
ala.(arg,  val,  lys) 

3 

thr.arg 
thr.arg.tyr 

4 

val.arg 

thr.(asp,  ala).arg 
(enzymic) 

2 

val.arg 
ser.(val,  arg).ala 

ala.his 

1 

ala.his 

1 

his.cys 
ala.his.cys 

his.cys 

2 

phe.his 
his.phe 

2 

tyr.his 

3 

pro.his 

*  This  sequence  has  been  confirmed  very  recently  by  enzymic  hydrolysis. 

Table  8  shows  that  there  is  also  a  marked  resemblance  in  the  composition 
and  structure  of  the  lysine-containing  peptides  of  the  two  proteins,  although 
in  this  case,  owing  to  the  large  number  of  lysine  residues  present  and  the 
small  number  of  peptides  isolated,  no  definite  conclusions  as  to  their  struc- 
tural similarities  can  be  drawn. 

A  general  review  of  our  preliminary  results  on  the  comparison  of  the 
microstructure  of  chymotrypsinogen  and  trypsinogen  first  of  all  shows  that 
our  findings  are  in  full  agreement  with  the  structural  fragments  so  far  de- 
tected in  these  proteins  by  other  workers.  The  extensive  structural  similarity 
estabhshed  for  a  number  of  peptides  at  defined  sites  in  the  molecules  of  the 
two  proteins  permits  the  conclusion  that  there  is  considerable  resemblance 
in  their  microstructures.  However,  the  molecules  must  necessarily  also  con- 
tain larger  peptide  fragments  which  diff"er  in  the  two  cases;  this  follows 
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Table  8 

PEPTIDES  OF  LYSINE  ISOLATED  FROM  PARTIAL 

ACID     HYDROLYSATES     OF     BOVINE     CHYMO- 

TRYPSINOGEN    AND    TRYPSINOGEN 


Chymotrypsinogen 

Trypsinogen 

lys.Iys 

gly,  lys 

gly.  lys 

(i)Ieu.lys 

ser.Iys 

ser.lys 

thr.Iys 

thr.lys 

val.Iys 

tyr.Iys 

lys.ala 

lys.(ala,  ileu) 

lys.ileu 

lys.leu 

lys.leu 

lys.pro 

lys.val 

lys.(val,  phe) 

lys,  cys 

lys,  cys 

lys,  glu 

lys,  glu 

lys,  glu,  ileu 

lys,  glu,  (i)leu 

lys,  leu,  ileu 

ser.(asp,  cys,  thr).lys 

(enzymic) 

particularly  from  the  small  number  of  different  peptide  sequences  involving 
cysteine^  detected  in  partial  hydrolysates  of  trypsinogen,  which  contrasts 
with  the  total  number  of  cysteine  (half-cystine)  residues  (11-13)  present  in 
this  protein.  Finally,  the  results  of  our  work  on  the  arginine  peptides  may 
prove  of  value  in  the  concluding  phases  of  the  full  structure  determination 
of  the  two  proteolytic  enzymes  since  they  establish — unambiguously  in  the 
case  of  chymotrypsinogen — the  amino  acid  sequence  at  those  very  sites  at 
which  fission  of  the  peptide  chains  takes  place  in  controlled  enzymic  hydro- 
lysis. 

I  should  further  like  to  give  an  account  of  some  of  our  preliminary  find- 
ings in  a  comparison  of  the  microstructure  of  two  haemoglobins  (horse  and 
hog) — proteins  where,  for  practical  reasons,  we  may  not  for  the  time  being 
expect  an  attack  on  the  total  structure.  In  this  work  we  have  so  far  con- 
centrated on  the  comparison  of  the  peptides  of  arginine^"  and  histidine^^ 
(of  which  the  haemoglobin  molecule  contains  14  and  36-38  residues,  respec- 
tively), and  partly  those  of  lysine.  The  isolation  of  the  basic  peptides  from 
partial  hydrolysates  of  the  globins  prepared  from  pure,  crystalline  samples 
of  the  haemoglobins  was  carried  out  by  a  combination  of  electrophoretic 
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methods,  ion  exchange  on  Dowex  50,  and  paper  chromatography.  Table  9 
shows  the  composition  or  structure  of  the  individual  basic  peptides  isolated. 
The  close  resemblance  between  the  structures  found  in  the  two  proteins  is 
quite  evident.  Although  the  isolation  of  both  the  arginine  and  the  histidine 
peptides  was  carried  out  with  particular  care  in  this  instance,  the  number 
of  individual  peptides  eventually  isolated  is  small  when  compared  with  the 
total  number  of  residues  of  the  two  basic  amino  acids  known  to  be  present 
in  the  haemoglobins.  Moreover,  no  evidence  was  found  for  the  presence  of 
histidine  or  arginine  in  the  neutral  or  acid  peptide  fractions.  Our  results 

Table  9 

PEPTIDES    OF   ARGININE,    HISTIDINE,    AND 
LYSINE  ISOLATED  FROM  PARTIAL  HYDRO- 
LYSATES     OF    HORSE    AND    HOG    HAEMO- 
GLOBIN 


horse 

hog 

ala.arg. 

ala.arg. 

leu.arg. 

phe.arg. 

phe.arg. 

ser,  arg. 

ser.arg. 

tyr.arg. 

tyr.arg. 

arg.leu 

arg.leu 

arg.phe 

arg.phe 

arg,  lys 

arg,  lys 

lys,  his,  arg 

ser.his 

ser.his 

ala.his 

ala.his 

leu. his 

leu.his 

his.ala 

his.ala 

his.lys 

his.lsy 

ser.(his,  lys) 

ser.(his,  lys) 

his.  (lys,  ala) 

his.(lys,  ala,  val) 

his,  val 

his,  lys,  gly 

gly.lys 

gly.lys 

ser.lys 

thr,  lys 

thr.Iys 

his.lys 

his.lys 

lys.ala 

lys.ala 

lys,  val 

lys.val 

lys,  leu 

lys,  leu 

ser.  (his,  lys) 

ser.  (his,  lys) 

lys,  ala,  val 

lys.(ala,  val) 

lys,  gly,  thr 

leu,  lys,  val 

lys,  his,  gly 

lys,  ser,  ala 
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thus  indicate  not  only  an  extensive  similarity  in  the  structure  of  the  two 
hemoglobins  examined,  but  also  either  the  repetition  of  certain  peptide 
sequences  involving  the  basic  amino  acids,  or,  more  probably,  the  exist- 
ence of  several  identical  units  in  the  molecules  of  these  proteins.  This  last 
assumption  is  borne  out  by  the  work  of  Reichmann  who  found  that  under 
certain  conditions  the  haemoglobin  molecule  may  dissociate  into  four  parts. ^^ 
In  this  review,  I  have  attempted  to  demonstrate  the  principles  on  which 
we  base  our  approach  to  the  problem  of  protein  microstructure  and  some 
of  the  first  results  of  this  approach. 
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Discussion  on  the  paper  of  Professor  Sorm 

B.  KEIL 


Allow  me  to  supplement  Professor  Sorm's  lecture  by  a  few  remarks  concern- 
ing methods  and  techniques.  In  the  work  discussed  by  Professor  Sorm  we 
have  been  using  a  number  of  procedures  worked  out  in  our  laboratories, 
in  addition  to  the  well-known  and  well-tried  methods  of  separation  and 
identification.  For  following  the  course  of  fractionation  on  Dowex  columns 
or  by  multi-compartment  electrophoresis  we  use  paper  electrophoretic  ana- 
lysis of  samples  in  place  of  the  usual  ninhydrin  colorimetry.  This  method 
gives  rapid  approximate  information  not  only  of  the  amount,  but  also 
of  the  nature  of  the  material  in  the  samples.  The  arrangement  used  for 
this  purpose  is  very  simple.^  A  sheet  of  Whatman  paper  is  loosely 
stretched  vertically  between  two  troughs  containing  pyridine  acetate  buffer 
of  a  low  and  defined  conductivity.  Voltages  as  high  as  1500  V,  corre- 
sponding to  a  potential  gradient  of  about  40  V/cm.,  can  be  appUed  without 
cooling  (Fig.  1). 

The  analytical  separation  is  very  uniform  ;  the  time  of  each  run  is  1  -0  to 
1-5  hours.  Up  to  200  mg.  of  a  peptide  mixture  can  be  resolved  on  a  single 
sheet  of  Whatman  No.  3  paper.  Sanger  has  used  a  four-compartment  elec- 
trophoretic apparatus  for  the  primary  fractionation  of  acid  hydrolysates  of 
insulin.  We  have  found  a  five-compartment  apparatus  most  convenient,  with 
formic  acid,  acetic  acid  and  ammonia  as  the  electrolytes.^  The  omission  of 
the  sulphuric  acid  and  the  introduction  of  the  fifth  compartment  removes 
the  danger  of  loss  or  contamination  of  the  material  at  the  electrodes.  The 
separation  of  partial  hydrolysates  into  basic,  acidic  and  neutral  peptide 
fractions  is  clear-cut,  as  may  be  seen  from  the  paper  electrophoretic  check 
of  the  individual  fractions. 

An  extension  of  the  membrane  system  of  electrophoresis  led  to  the  de- 
velopment in  our  laboratories  of  a  multi-compartment  apparatus  for  the 
fractionation  of  peptides  and  proteins.^  In  a  set-up  of  one  hundred  com- 
partments, with  water-cooling  and  pyridine  acetate  buffer,  and  working  at 
5000  V,  a  progressive  fractionation  of  peptide  mixtures  can  be  achieved 
practically  without  losses.  The  fractions  are  withdrawn  from  the  individual 
compartments  by  pipetting,  and  directly  lyophilized.  A  similar  arrangement 
of  twenty  compartments  with  paper  membranes  has  been  used  for  the  pre- 
parative fractionation  of  proteins  and  of  mixtures  of  nucleic  acids,  poly- 
saccharides and  proteins  (Fig.  2). 
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With  compartments  of  10  ml.  capacity  each,  up  to  1  g.  of  material  may 
be  fractionated. 

There  are  two  further  techniques  which  I  would  like  to  mention:  one  for 
the  quantitative  determination  of  amino  acids,  and  the  second  for  carrying 
out  dinitrophenylation  analysis. 

The  most  precise  method  for  the  quantitative  analysis  of  amino  acid 
mixtures  today  is,  of  course,  the  technique  of  Moore  and  Stein;  however, 
for  the  routine  quantitative  determination  of  the  amino  acid  composition 
of  peptide  fractions  we  have  worked  out  four  years  ago  a  simpler  pro- 
cedure.^ This  required,  first,  an  adaptation  of  the  paper-chromatographic 
technique  which  would  permit  the  separation  of  as  many  of  the  amino  acids 
as  possible  in  a  single  direction,  so  that  a  number  of  samples  might  be 
analysed  in  parallel  (Fig.  3). 

This  problem  was  essentially  solved  by  repeated  development  of  the 
chromatogram,  with  drying  in  between  each  solvent  run.  After  detection, 
the  chromatogram  is  photographed  on  cinefilm  and  the  blackening  of  the 
film  is  measured  using  a  precision  spectrographic  microphotometer.  A  blank 
reading  for  any  part  of  the  paper  is  readily  obtained  from  the  photographic 
record.  Since  the  photographs  are  taken  by  reflected  light  the  inhomogeneity 
of  the  paper  fibres  is  of  no  consequence.  The  same  technique  can  be  used 
for  example  for  the  direct  quantitative  determination  of  glutamic,  aspartic, 
or  cysteic  acid,  histidine,  etc.,  after  paper-electrophoretic  separation,  or,  in 
general,  for  the  estimation  of  any  compound  giving  a  coloured  spot.  In 
another  laboratory  of  our  Institute  this  technique  is  for  instance  being  used 
for  the  determination  of  elementary  sulphur.  The  areas  under  the  curves 
obtained — which  resemble  those  obtained  by  the  Moore  and  Stein  method 
— are  converted  graphically  into  equivalents  of  each  particular  compound. 

Finally,  a  few  words  about  the  dinitrophenylation  technique:  With  lower 
peptides  on  a  micro  scale,  i.e.  with  amounts  of  about  5-30  fj-g,  we  use  various 
modifications  of  the  following  procedure  :2 


The  peptide  is  dinitrophenylated  in  the  microflask  and  then  selectively 
retained  on  a  microcolumn  of  Zerolite  H,  consisting  of  about  20  yug.  of  the 
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ion  exchanger.  All  other  components  of  the  mixture,  including  unreacted 
peptide,  are  eluted.  The  DNP-peptide  is  then  desorbed  back  into  the  flask, 
evaporated  to  dryness  and,  after  removal  of  the  ion-exchanger,  hydrolysed 
in  situ.  The  DNP  amino  acid  is  adsorbed  on  a  fresh  batch  of  the  ion  ex- 
changer, the  free  amino  acids  being  eluted.  The  adsorbed  material  is  eluted 
back  into  the  flask,  the  dinitrophenol  is  removed  by  vacuum  sublimation 
and  the  residual  DNP  amino  acid  is  identified  by  paper  chromatography. 
This  procedure  eliminates  the  extraction  with  organic  solvents  and  the 
separation  into  water-soluble  and  ether-soluble  fractions.  The  technique, 
which  readily  lends  itself  to  work  with  whole  series  of  samples,  was  used 
for  most  of  the  peptides  mentioned  in  Professor  Sorm's  paper. 
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Zone  electrophoresis  and  chromatography 
as  methods  for  the  purification  and 
characterization  of  proteins 

A.  TISELIUS 

Biochemical  Institute,  Uppsala  University,  Sweden 


The  Commission  on  Proteins  was  created  at  the  lUPAC  Congress  in  New 
York  and  Washington  1951  and  had  as  its  first  task  to  investigate  the  possi- 
bility of  finding  a  standard  protein  preparation  which  would  meet  reason- 
able requirements  of  purity  and  homogeneity.  This  is  not  the  time  to  discuss 
the  very  valuable  work  of  this  commission.  I  think,  however,  that  I  am  right 
in  saying  that  the  task  proved  much  more  difficult  than  could  be  foreseen. 
Still,  I  believe  that  if  nothing  else  had  come  out  of  the  work  than  a  realiza- 
tion of  the  difficulty  of  defining  a  pure  protein,  this  would  have  been  well 
worth  the  eff"ort  spent. 

Some  may  perhaps  ask  how  far  we  should  go  in  our  demands  as  to  the 
purity  of  proteins,  and  if  there  is  not  a  danger  that  we  might  become  protein 
'purists',  interested  in  purity  for  its  own  sake.  Such  an  argument  neglects, 
however,  several  facts  of  importance  in  protein  research  to-day.  One  is  that 
the  very  character  of  the  modern  sequence  analysis  methods,  developed  by 
Sanger,  assumes  a  high  degree  of  homogeneity.  It  would  be  an  almost  hope- 
less task  to  fit  the  puzzle  of  peptide  fragments  together,  if  these  residues 
come  from  more  than  one  homogeneous  protein.  Secondly,  the  high  resolu- 
tion of  modern  methods  of  protein  separation  has  in  an  increasing  number 
of  cases  revealed  a  complexity  or  micro-heterogeneity  which  appears  to  be 
extremely  interesting.  I  am  thinking  now  not  only  of  the  various  patho- 
logical varieties  of  hemoglobin  which  were  first  demonstrated  by  Pauling, 
Itano,  Singer  and  Wells  (1949),  but  also  of  a  number  of  enzyme  proteins 
which  appear  to  consist  of  mixtures  of  closely  similar  but  distinctly  different 
components.  We  cannot  yet  judge  the  significance  of  these  results — for  ex- 
ample, how  these  differences  are  reflected  in  specificity  or  activity — but  this 
is  certainly  an  important  field  for  further  investigations.  In  some  cases  this 
variation  appears  to  be  genetically  controlled.  Besides  the  pathological 
hemoglobins  this  complexity  has  now  also  been  established  for  normal 
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proteins  (e.g.  the  sheep  hemoglobins  studied  by  Harris  and  Warren  (1956; 
see  also  Evans  et  al.,  1956)  and  the  haptoglobins  in  human  serum  studied 
by  Smithies  (1955),  Smithies  and  Walker  (1955)  and  by  Smithies  and  Poulik 
(1956). 

I  shall  confine  myself  here  to  a  discussion  of  some  recent  developments 
of  two  methods,  namely  electrophoresis  and  chromatography,  in  their  appli- 
cation to  the  separation  of  proteins  and  protein-like  substances.  Of  these 
methods  electrophoresis  has  long  been  known  as  a  particularly  efficient 
method  of  studying  the  homogeneity  of  proteins.  The  application  of  chroma- 
tography to  such  problems  is  more  recent,  but  the  last  few  years  have  given 
us  several  examples  demonstrating  that  the  specificity  of  this  method,  well 
known  from  its  appHcation  to  low-molecular  weight  substances,  is  also  quite 
striking  with  large  molecular  weight  material. 

In  electrophoresis  the  most  important  recent  advance  appears  to  me  to 
be  the  development  of  zone  methods  and  their  application  to  separation 
problems  on  a  macro  and  micro  scale.  Fig.  1  illustrates  schematically  the 
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Fig.  1 .  Schematic  representation  of  the  principal  difference  between  boundary  and  zone 
methods  of  separation. 


difference  in  principle  between  boundary  methods  (as  applied  in  the  standard 
moving  boundary  procedure)  and  zone  methods  in  the  separation  of  two 
substances,  A  and  B.  It  is  seen  that  the  zone  method  requires  a  supporting 
medium  to  stabilize  the  zones  against  gravity  convections.  Thus  the  experi- 
ments are  performed  in  columns  or  troughs  packed  with  cellulose,  starch, 
glass  powder  or  similar  materials,  or  in  filter  paper  strips  (paper  electro- 
phoresis). In  certain  cases  zone  experiments  have  been  performed  in 
gels  (especially  when  serological  reagents  are  to  be  apphed — 'Immuno- 
electrophoresis') or  in  density  gradients.  The  apparatus  required  is  rather 
simple.  As  an  example.  Fig.  2  shows  a  zone  column  electrophoresis  apparatus 
constructed  by  Porath  (1956,  1957  a,  b)  in  Uppsala. 

The  column,  which  is  surrounded  by  a  cooling  jacket,  is  packed  with 
cellulose  powder.  Different_^types  of  columns  may  be  used,  depending  upon 
the  desired  degree  of  resolution  and  the  capacity  required.  Quantities  of 
up  to  50  g  of  protein  have  been  separated  in  columns  of  this  type.  Fig.  3 
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Fig.  2.  Zone  column  electrophoresis  apparatus 
according  to  Porath  (1956).  The  packed  column 
is  seen  to  the  right,  surrounded  by  a  cooling 
mantle.  Below  are  the  electrode  vessels  with  a 
levelling  arrangement. 


shows  the  zone  electrophoresis  laboratory  in  the  Institute  of  Biochemistry, 
Uppsala,  with  several  columns  in  operation.  In  this  procedure  the  zones, 
after  having  been  separated  by  electrophoretic  migration,  are  eluted  from 
the  column  and  collected  in  a  standard  automatic  fraction  collector  of  the 
type  commonly  used  in  chromatographic  work.  This  has  the  advantage  that 
the  columns  may  be  used  many  times  without  repacking.  In  other  types  of 
apparatus  (see  especially  Kunkel,  1954)  flat  horizontal  troughs  are  used. 
This  makes  the  surface  of  the  supporting  medium  more  easily  accessible, 
so  that  staining  reactions  can  be  applied  for  the  localization  of  the  zones, 
which  may  then  be  isolated  by  cutting  the  trough  contents  accordingly.  This, 
however,  makes  it  necessary  to  pack  the  trough  anew  for  each  experiment. 
Experiments  in  gels  often  give  very  sharp  separations,  but  elution  is  diffi- 
cult. Paper  strip  electrophoresis  has  become  extremely  popular  on  account 
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of  its  simplicity  and  good  resolution,  and  has  found  wide  application  as 
a  micro-method.  To  a  certain  extent  its  relation  to  other  types  of  zone  electro- 
phoresis is  similar  to  that  of  paper  chromatography  to  column  chromato- 
graphy, but  the  heat  generated  by  the  current  may  give  rise  to  evaporation 
disturbances  which  make  the  method  less  suitable  for  quantitative  work. 

Recent  experiments  in  Uppsala  with  very  small  columns  show  that  quan- 
tities as  small  as  those  commonly  used  in  paper  strip  electrophoresis  may 
also  be  studied  in  columns  under  more  rigidly  controlled  conditions  than 
in  the  paper  strip  method. 

In  the  discussion  which  is  to  follow  Dr  Porath  will  give  some  examples  of 
the  application  of  the  zone  column  technique.  I  shall  limit  myself  to  a  few 
comments  on  the  advantages  and  limitations  of  the  zone  methods  in  general. 

The  advantages  of  the  zone  methods  over  the  classical  boundary  pro- 
cedure are  obvious,  especially  that  in  the  former  it  is  possible  to  achieve 
complete  separation  of  the  migrating  components  and  to  isolate  them  for 
further  investigation.  In  the  boundary  method  the  zones  overlap  partially 
and  conclusions  are  drawn  only  from  optical  observation  of  their  migra- 
tion. A  further  study  of  the  properties  and  diiference  in  the  characteristics 
of  components  is  difficult  or  (with  closely  similar  substances)  impossible. 
In  addition  the  zone  methods  are  much  easier  to  apply  to  low-molecular 
weight  compounds,  like  amino  acids  and  peptides.  Boundary  electrophoresis 
of  such  substances  has  proved  very  difficult  because  of  their  considerable 
contribution  to  the  conductivity  of  the  system,  which  results  in  boundary 
anomalies.  In  zone  electrophoresis  these  anomalies  are  apparently  much  less 
pronounced,  probably  because  of  the  easy  interchange  of  ions  by  diffusion 
between  the  front  and  the  rear  of  a  zone  and  by  a  gradual  automatic  adjust- 
ment of  the  concentration  to  lower  values.  A  further  advantage,  which  has 
been  demonstrated  by  Porath  (1957  b),  is  that  the  reduction  in  diffusion  in 
powdered  supporting  medium  is  much  greater  than  the  reduction  in  cur- 
rent; the  heat  generated  by  the  current  being  carried  away  by  the  stabilizer 
almost  as  efficiently  as  in  free  solution.  This  contributes  very  much  to  the 
separation  efficiency,  particularly  with  low-molecular  weight  substances. 

One  has  to  pay  a  price,  however,  for  all  these  advantages.  Most  materials 
suitable  for  use  as  supporting  media  show  some  interaction  with  proteins 
and  other  substances.  There  is  of  course  a  mechanical  hindrance  but,  above 
all,  specific  interaction  by  adsorption  is  very  common,  leading  to  a  spread- 
ing of  the  zones  by  taihng  and  influencing  their  mobilities  in  a  way  which 
is  difficult  to  predict.  This  is  the  key-problem  of  the  zone  method  and  much 
work  has  already  been  done  to  find  more  inert  supporting  materials  which 
would  show  as  small  interaction  as  possible  with  the  migrating  substances. 
Flodin  and  Kupke  (1956)  have  prepared  a  partially  esterified  cellulose  by 
ethanolysis,  which  has  proved  much  superior  to  ordinary  cellulose  for  zone 
electrophoresis  columns  and  is  now  generally  used  in  the  Uppsala  labora- 
tory for  protein  and  peptide  separations.  No  doubt  other  possibilities  will 


Fig.   3.    Zone   electrophoresis   laboratory    at    the    Institute    of 
Biochemistry,  Uppsala. 


Fig.  4.  Chromatographic  separation  of  R-phycoerythrin  and  R-phycocyanin  in  an 
extract  of  Ceramium  rubrum.  Calcium  phosphate  (Brushite)  column,  saturated  with 
0-005M  phosphate  buflfer,  pH  =  6-8.  Elution  of  phycoerythrin  (same  buffer,  0-02m,  ex- 
posures 1-2)  and  of  phycocyanin  (same  buflfer,  004m,  exposures  3-5).  (From  A.  Tiselius, 
S.  Hjertén  and  Ü.  Levin,  lb.  (1956).) 
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be  explored.  It  is  of  course  desirable  that  the  water  content  of  the  column 
is  as  high  as  possible.  In  this  respect  cellulose  has  advantages:  a  column 
packed  by  sedimentation  contains  approximately  85%  water,  as  compared 
with  40%  for  glass  powder  and  50%  in  starch,  of  which  only  33%  is  free 
(Porath,  1957  b). 

The  effects  mentioned  represent  a  difficulty  not  only  by  interfering  with 
the  separation  but  also  as  regards  the  evaluation  of  the  results  in  terms  of 
mobihty.  It  is  true,  of  course,  that  most  zone  electrophoresis  experiments 
are  made  with  the  separation  as  chief  purpose,  and  that  for  accurate  mobility 
determinations  the  free  moving  boundary  electrophoresis  method  is  pre- 
ferred. The  possibility  of  quantitative  control  of  the  separation  has,  how- 
ever, been  an  attractive  feature  of  the  electrophoretic  methods,  and  it  would 
be  a  pity  if  this  would  have  to  be  sacrificed  in  order  to  gain  the  advantages 
offered  by  the  zone  methods.  If  there  is  no  interaction  between  the  migrating 
substances  and  the  supporting  material  except  the  purely  mechanical  hind- 
rance (or,  strictly  speaking,  if  such  interaction  and  mechanical  hindrance 
influence  the  mobilities  of  all  substances  present,  including  the  buffer  ions, 
to  the  same  extent)  it  ought  to  be  possible  to  determine  mobilities  by  measur- 
ing the  volume  distances  between  the  peaks  in  the  elution  diagrams  obtained 
from  zone  electrophoresis  columns,  and  comparing  them  with  the  distance 
of  migration  of  a  substance  of  known  mobility.  Attempts  in  this  direction 
have  been  made,  but  I  believe  it  is  very  important  to  pursue  such  investiga- 
tions with  the  more  inert  supporting  materials  now  available. 

When  turning  now  to  protein  chromatography  I  wish  to  say  first  of  all 
that  this  field  is  much  more  recent  and  considerably  more  difficult  to  discuss. 
It  appears,  however,  as  if  the  application  of  chromatography  to  proteins 
of  large  molecular  weight  has  now  left  the  'lag  phase'  (to  use  an  expression 
from  bacteriology).  Whether  it  has  definitely  entered  a  'log  phase'  remains 
to  be  seen,  but  there  are  already  a  number  of  beautiful  results  which  indi- 
cate that  the  method  ofifers  great  possibilities.  It  is  equally  clear,  however, 
that  the  underlying  principles  are  not  yet  well  understood  and  that  the 
methods  involve  certain  risks  of  making  grave  mistakes. 

There  is  an  excellent  recent  review  of  this  whole  field  by  Moore  and  Stein 
(1956)  and  for  this  reason  I  shall  limit  myself  to  some  brief  remarks.  The 
particular  difficulties  ofifered  by  proteins  in  chromatography  depend  on 
their  large  molecular  size,  which  may  prevent  them  from  entering  into  the 
pores  of  the  column  material  or  make  their  reactions  with  this  material  too 
slow.  Also,  the  instabihty  of  proteins  and  particularly  the  risk  of  denatura- 
tion  at  surfaces  is  an  obstacle.  Most  of  the  common  types  of  chromatography 
as  applied  to  aqueous  solutions  have  been  used  for  protein  separations,  i.e. 
ionic  exchange,  partition  and  adsorption.  In  each  case,  however,  one  has 
found  that  only  certain  systems  have  the  properties  required.  Both  cationic 
and  anionic  exchange  resins  have  been  used,  as  well  as  ionic  exchangers  pre- 
pared from  cellulose.  Partition  systems  have  been  worked  out,  consisting 
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of  glycols,  water  and  salts,  in  which  insulin  and  recently  also  y-globulins 
have  been  fractionated.  Among  adsorbents  calcium  phosphate  seems  to 
offer  special  advantages,  and  I  will  show  some  pictures  illustrating  this 
method  (Fig.  4;  for  details  see  figure  text). 

This  experiment  illustrates  a  behaviour  which  is  often  observed  in  protein 
chromatography,  and  which  is  markedly  different  from  what  is  usually  ex- 
perienced in  elution  chromatography  of  low-molecular  weight  substances. 
It  is  very  difficult  or  impossible  to  find  a  buffer  solution  which  will  elute 
both  the  phycoerythrin  and  the  phycocyanin  as  reasonably  well-defined 
zones,  migrating  with  different  7?/-values.  It  appears  that  the  Rf  of  many 
proteins  is  highly  dependent  on  the  concentration  of  the  eluting  buffer,  so 
that  its  value  will  shift  from  almost  zero  to  almost  1  -0  in  a  comparatively 
narrow  interval  of  buffer  concentration.  This  is  to  be  expected,  according 
to  the  law  of  mass  action,  if  the  attachment  of  one  protein  molecule  to  the 
column  will  lead  to  a  liberation  of  several  small  ions.  The  consequence  is, 
however,  that  elution  in  the  ordinary  way  is  not  suitable,  but  rather  a  step- 
wise or  gradient  elution  must  be  apphed.  The  step-wise  elution  is  often  very 
efficient,  but  involves  the  risk  that  a  slowly  eluted  component  may  appear 
in  more  than  one  step.  In  the  interpretation  of  such  experiments  it  is  essen- 
tial to  rechromatograph  the  fractions  obtained  to  ascertain  if  their  different 
chromatographic  properties  can  be  reproduced  in  separate  experiments.  I 
want  to  stress  this  particularly,  as  the  results  so  far  often  indicate  a  com- 
plexity of  proteins  which  were  earlier  believed  to  be  reasonably  homogeneous. 
The  pronounced  dependence  of  the  partition  coefficient  of  proteins  on  the 
composition  of  the  two-phase  system  is,  of  course,  an  analogous  phenomenon. 

There  is  no  doubt  that  chromatography  of  proteins  already  has  shown  its 
usefulness  both  in  preparative  and  analytical  work,  and  that  the  method 
appears  to  possess  a  high  degree  of  specificity.  I  shall  quote  only  a  few 
examples  : 

Synthetic  polycarboxylic  acid  resins:  cytochrome-c  ribonuclease,  lyso- 
zyme,  chymotrypsinogen,  chymotrypsin,  hemoglobins,  histones. 
Cellulose  ionic  exchangers:  serum  proteins. 
Synthetic  anion  exchange  resins  :  serum  proteins,  phosphatases. 
Partition  chromatography:  ribonuclease,  insulin,  y-globulins. 
Adsorption  chromatography:  various  proteins  and  enzymes. 

References  are  found  in  the  above-mentioned  review  by  Moore  and 
Stein  (1956). 

With  the  'all  or  none'  elution  behaviour  of  many  proteins  discussed  above 
one  may  of  course  ask,  whether  column  operation  has  any  particular  ad- 
vantage or  if  one  could  not  just  as  well  apply  a  batch  operation.  This  is  no 
doubt  practical  in  many  cases.  However,  column  operation  has  certain  prac- 
tical advantages,  particularly  with  systems  of  unknown  behaviour.  It  should 
be  noted  also  that  even  if  the  length  of  the  column  does  not  contribute 
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essentially  to  the  separation  and  if  each  component  can  be  eluted  at  a  bufTer 
concentration  giving  /?/=!,  mutual  displacement  effects  may  contribute  very 
much  to  the  separation  of  closely  similar  substances,  and  in  this  case  a  com- 
paratively long  column  is  of  value.  Such  displacement  effects  are  very 
common  and  often  very  pronounced  among  proteins.  Zones  may  often  be 
found  which  during  migration  through  the  column  show  marked  separa- 
tion without  splitting  up  into  two  distinct  peaks,  the  slower  material  being 
carried  along  with  the  faster  by  the  reduction  in  affinity  to  the  column  caused 
by  the  displacement  effect.  It  appears  likely  that  a  closer  study  of  such  dis- 
placement effects  may  lead  to  a  better  understanding  of  the  phenomena 
underlying  protein  chromatography  and  perhaps  also  to  more  efficient 
and  rational  modes  of  operation. 

In  the  discussion  to  follow,  Dr  H.  Boman  will  show  some  results  which 
illustrate  the  importance  of  displacement  effects  in  the  chromatography  of 
some  proteins. 
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Mutual  displacement  in  protein  chromatography 

HANS  G.  BOMAN 

Institute  of  Biochemistry,  University  of  Uppsala 


Similar  substances  have  sometimes  a  marked  tendency  to  compete  for  the 
same  sites  on  an  adsorbent.  This  mechanism  was  in  1943  used  for  chromato- 
graphic separations  by  Tiselius^  who  showed  that  when  a  mixture  of  sucrose 
and  maltose  were  eluted  with  0-5%  phenol,  the  sugars  came  out  from  the 
column  as  narrow  zones,  bordering  each  other.  This  type  of  chromatography 
was  named  displacement  development. 

Ten  years  later.  Polis  and  Shmukler^  were  the  first  to  describe  and  utilize 
protein-protein  displacement  in  their  purification  of  coloured  milk  proteins 
with  calcium  phosphate  columns.  Unfortunately,  they  did  not  give  any  figures 
illustrating  their  displacement  experiments  and  nobody  else  seems  to  have 
made  use  of  their  interesting  findings. 

Less  than  two  years  ago,  when  working  out  the  technique  for  protein 
chromatography  on  the  anion  exchange  resin  Dowex  2,^  we  found  that 
when  a  mixture  of  horse-radish  enzymes  was  eluted  with  a  concentrated 
buffer,  peroxidase  and  acid  phosphatase  were  separated  into  two  narrow 
bands  as  shown  in  Fig.  1,  upper  part.  The  same  separation  has  later  been 
obtained  also  on  a  cellulose  anion  exchanger.  Fig.  1,  lower  part.* 

In  ideal  displacement,  the  length  of  the  zone  should  be  proportional  to 
the  amount  of  material,  and  the  concentration  should  depend  on  the  affinity 
constant  of  the  substance  and  the  concentration  of  the  displacing  agent.  ^ 
When  the  loading  of  the  columns  were  varied,  we  obtained  variations  in 
peak  width  indicating  that  protein-protein  displacement  was  a  major  factor 
contributing  to  the  separation  of  peroxidase  and  the  first  phosphatase.  This 
was  different  with  the  second  phosphatase  which  obviously  was  eluted  by 
the  chloride  concentration. 

However,  formally,  these  experiments  (Fig.  1)  could  be  characterized  as 
elution  with  a  very  steep  gradient  of  chloride  and,  in  fact,  the  referee  of  our 
second  paper*  did  never  believe  that  any  protein-protein  displacement  was 
demonstrated  by  our  experiments. 

It  was  therefore  thought  to  be  of  interest  to  show  that  a  moderately  ad- 
sorbed enzyme,  prostatic  acid  phosphatase,  could  be  desorbed  by  a  more 
strongly  adsorbed  protein.  Dowex  2  was  chosen  as  adsorbent  only  because 
of  the  fact  that  we  had  elution  data  for  a  variety  of  proteins  just  on  this 
resin. 3  The  dimensions  of  the  column  were  19x1-2  cm.  The  buffer  con- 
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Fig.  1.  (a)  Upper  part:  One-step  elution  of  100  mg.  horse-radish  peroxidase  and  acid 
phosphatase  on  a  180  ml.  column  of  Dowex  2.  Fraction  volumes  about  0-65  ml.  For 
details  see  ref.  3.  (6)  Lower  part:  One-step  elution  of  35  mg.  horse-radish  peroxidase  and 
acid  phosphatase  on  a  17  ml.  column  of  TEAE  cellulose.  Fraction  volumes  0-5-0-6  ml. 
For  details  see  ref.  4. 


centration  has  been  kept  constant  through  each  experiment  and  the  de- 
veloper used  was  a  aa-globuHn  fraction  from  human  serum  (kindly  supplied 
by  AB  KABI,  Stockholm).  The  results  from  three  experiments  carried  out 
at  different  buffer  concentrations  are  given  in  Fig.  2.  At  the  start,  the  column 
was  in  equilibrium  with  a  THAM  (2-amino-2-hydroxymethylpropane-l  :  3 
diol)-HCl  buffer  of  pH  7-3.  In  each  experiment,  the  same  amount  of  freeze- 
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dried  prostatic  extract  (1  mg.)  was  dissolved  in  the  corresponding  buffer 
and  applied  to  the  column.  Elution  was  then  directly  started  with  a  0-1% 
solution  of  aa-globulin,  also  dissolved  in  the  corresponding  buffer.  Phos- 
phatase activity  was  determined  and  expressed  as  described  earlier.^ 
The  upper  part  of  Fig.  2  shows  that  with  0-02m  buffer,  only  small  amounts 
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Fig.  2.  Displacement  analysis  of  prostatic  acid  phosphatase  at  three  different  concentra- 
tions of  THAM-HCI  buffers  of  pH  7-3.  In  each  experiment,  1  mg.  of  prostatic  extract 
was  applied  to  a  column  with  20  ml.  of  Dowex  2  and  developed  with  a  0*1%  solution  of 
og-globulin  from  human  serum.  Fraction  volumes  around  0-8  ml. 

of  phosphatase  (recovery  4%)  are  eluted  as  a  very  broad  zone  and  that  the 
front  of  the  activity  coincides  with  that  of  the  developer.  As  a  whole,  this 
experiment  looks  like  an  unsuccessful  frontal  analysis.  However,  when  the 
adsorption  of  the  phosphatase  is  decreased  by  increasing  the  buffer  con- 
centration to  0-06m,  22%  of  the  activity  appears  as  a  localized  zone,  just  at 
the  beginning  of  the  front  of  the  developing  aa-globulin  (Fig.  2,  middle 
part).  With  a  further  decrease  in  the  adsorption  of  the  enzyme,  when  using 
a  0-IOm  THAM-HCI  buffer,  62%  of  the  activity  is  recovered  in  a  sharp 
zone  which  has  moved  forward  with  regard  to  the  front  of  the  developer 
(Fig.  2,  lower  part).  The  small  peak  around  tube  10  is  non-adsorbed  material. 
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These  experiments  demonstrate  that  protein-protein  displacement  may  be 
a  practically  useful  phenomenon  and  that  the  optimal  buffer  concentration 
is  an  important  factor.  Sober  et  al.^  describe  how  serum,  during  adsorption 
on  cellulose  anion  exchanger,  separates  into  a  number  of  'distinct  coloured 
bands',  which,  however,  are  spread  out  during  the  gradient  elution.  Similar 
observations  have  been  made  in  Uppsala,  especially  on  calcium  phosphate 
(Hjertén,  to  be  published).  The  most  likely  explanation  is  that  a  good  separa- 
tion is  obtained  by  mutual  protein  displacement  during  the  adsorption  step, 
which  then  is  spoiled  by  the  salt  gradient  which  passes  over  the  optimal 
condition.  It  may  therefore  be  possible,  that  in  the  case  of  marked  displace- 
ment effects,  a  better  resolution  of  a  chromatogram  is  obtained  with  a  pro- 
tein solution  as  developer  [cf.  (2)],  than  that  resulting  from  changes  in  buffer 
composition. 

It  has  been  suggested  that  it  is  easier  to  find  the  optimal  conditions  for 
the  separation  of  a  protein  mixture  with  zone  electrophoresis  than  with 
chromatography.  Personally  I  do  not  want  to  make  any  general  statement 
on  this  question,  but  I  feel  that  the  one-step  elution  on  an  anion  exchanger 
is  a  rapid  and  convenient  way  to  obtain  valuable  information  about  a  com- 
plicated enzyme  mixture.^-^-^ 
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During  the  course  of  the  fractionation  studies  required  to  isolate  a  new 
chemical  individual  from  a  naturally  occurring  or  synthetic  mixture  much 
is  usually  learned  about  the  nature  of  the  substance.  Thus  distillation  or 
the  ultracentrifuge  will  set  certain  limits  concerning  its  size,  electrophoresis 
gives  information  about  its  net  charge  at  specific  pH  levels,  countercurrent 
distribution  can  tell  something  of  its  polarity,  etc.  In  considering  the  various 
fractionation  techniques  now  available  for  isolating  relatively  large  mole- 
cules it  occurred  to  us  that  a  technique  which  would  separate  substances  on 
the  basis  of  their  size  or  shape  would  not  only  be  a  very  useful  technique 
supplementing  those  now  used,  but  could  also  give  very  informative  data 
about  size  and  shape.  With  this  idea  in  mind  we  have  turned  to  the  study 
of  membrane  diffusion. 

The  viewpoint  taken  in  this  work  differs  from  that  of  the  numerous  studies 
on  membranes  already  in  the  literature  in  a  number  of  ways.  Perhaps  most 
important — much  less  emphasis  has  been  placed  on  pore  size.  Membranes 
are  compared  on  the  basis  of  the  rate  they  permit  standard  solutes  of  known 
size  to  pass  by  diffusion  as  well  as  on  the  basis  of  total  exclusion.  An  attempt 
is  made  to  minimize  charge  efifects.  The  separate  units  are  designed  with 
fractionation  in  mind  and  with  the  intention  of  making  them  part  of  a 
countercurrent  process. 

Dialysis  is  one  of  the  earlier  separation  methods  to  have  been  used,  and 
is  familiar  to  everyone  for  separating  dialy sable  from  nondialy sable  solutes. 
However,  few  studies  attempting  to  refine  the  technique  beyond  this  point 
are  to  be  found  in  the  literature.  Probably  the  most  promising  were  those 
of  Signer^  and  co-workers,  who  experimented  with  a  continuous  counter- 
current  dialysis  train  of  unique  design. 

*  National  Science  Foundation  Fellow. 

j-  Fellow  of  the  National  Foundation  for  Infantile  Paralysis. 
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As  regards  the  measurement  of  size,  the  approach  we  have  followed  is 
similar  to  that  in  the  well-known  method  of  molecular  weight  determina- 
tion of  Northrop  and  Anson, ^  who  measured  the  rate  of  diffusion  of  a 
solute  through  sintered  glass.  However,  as  will  be  seen  later,  there  are 
marked  differences. 

It  was  realized  in  the  beginning  that  the  rate  of  diffusion  through  a  mem- 
brane would  be  a  function  of  a  number  of  interdependent  variables,  when 
the  sole  driving  force  is  a  concentration  gradient  across  the  membrane. 
These  are  the  nature  of  the  membrane,  charge  effects  on  the  membrane, 
effect  of  solvent,  pH,  ideaUty  of  the  solute,  temperature,  etc. 

In  prehminary  studies  cellophane  appeared  to  be  the  most  promising 
membrane  for  the  investigation.  A  simple  cell  was  constructed  from  a  cello- 
phane sac  in  which  both  the  contents  of  the  sac  and  the  solvent  around  it 
were  stirred.  By  measuring  the  rates  of  escape  of  solutes  of  known  size 
through  the  sac,  it  was  soon  found^  that  the  escape  rates  did  not  bear  the 
relationship  to  size  that  would  be  expected  from  free  diffusion  alone  and 
as  would  be  expected  from  the  Northrop  and  Anson  cell.  A  greater  selec- 
tivity was  noted  which  suggested  a  membrane  effect  in  addition  to  the  dif- 
ferential provided  by  diffusion. 


Fig.  1.  Schematic  drawing  of  the  microdialysis  cell.  A  7-mm.  glass 
tube  fitted  at  the  top  with  a  2-ml.  syringe  passes  through  two  larger 
sections  of  glass  tubing  A  and  B.  The  dialysis  membrane  represented 
by  the  dashed  line  is  made  by  tying  a  knot  in  a  wet  piece  of  cellophane 
tubing  with  care  to  avoid  any  stretching  of  the  membrane.  A  is  of  such 
an  outside  diameter  (ca.  15  mm.)  that  it  is  a  little  less  than  the  diameter 
of  the  extended  sac.  B  is  slightly  larger  in  diameter  than  A  and  serves  to 
hold  the  membrane  in  position.  C  is  a  test  tube  which  is  large  enough  to 
provide  about  2  mm.  clearance  from  the  membrane.  It  holds  the  outside 
solvent  amounting  to  about  10-20  times  the  volume  of  the  solution 
inside  the  sac.  The  upper  rim  of  C  is  collapsed  somewhat  in  order  to 
keep  it  centered.  The  lower  end  of  C  rests  on  the  table  and  can  be  easily 
centered.  During  operation  the  height  of  the  solutions  both  inside  the  sac 
and  outside  are  adjusted  to  reach  only  the  top  of  A.  A  small  hole  is 
punched  through  the  membrane  just  below  B  in  order  to  avoid  pressure 
being  built  up  during  the  run. 


w 


In  order  to  enhance  this  effect  it  appeared  desirable  to  try  less  porous 
membranes  so  that  the  solutes  would  be  even  more  restricted  in  their  pass- 
age, or  alternatively,  use  larger  test  solutes  with  the  same  membrane.  Either 
course  implied  a  much  slower  process,  although  the  time  involved  had 
already  proved  a  serious  handicap.  Therefore,  an  attempt  was  made  to  de- 
sign a  cell  which  would  give  a  faster  rate  of  net  transfer  across  the  membrane. 

It  is  known*  that  the  net  rate  of  transfer  under  a  concentration  gradient 
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of  solute  through  a  membrane  is  directly  proportional  to  the  cross  sectional 
area  of  the  membrane.  It  is  also  proportional  to  the  concentration  gradient 
and,  therefore,  for  a  given  weight  of  solute,  inversely  proportional  to  the 
volume  inside  the  sac.  The  most  rapid  net  rate  of  transfer  of  solute,  there- 
fore, would  be  given  when  the  solution  to  be  dialysed  is  spread  evenly  as 
a  thin  film  over  the  entire  inside  surface  of  the  dialysis  sac.  The  concentra- 
tion outside  should  be  kept  as  low  as  possible.  This  can  be  accomplished 
very  easily  by  the  simple  apparatus^  shown  schematically  in  Fig.  1. 

In  this  cell  all  the  solution  inside  is  held  in  a  thin  film  of  the  order  of 
0- 1-0-2  mm.  in  thickness.  No  stirring  is  required.  The  rate  of  escape  of  a 
solute  is  measured  by  periodic  removal  of  the  solution  outside  for  analysis 
and  replacement  with  fresh  solvent.  At  these  times  the  film  inside  can  be 
redistributed  by  moving  the  syringe  plunger  at  the  top  of  the  cell.  Actual 
experiment  showed  that  with  this  arrangement  the  rate  of  dialysis  for  a 
given  membrane  was  of  the  order  of  10-100  fold  that  afforded  by  the  usual 
dialysis  procedures. 

If  the  experimental  arrangement  is  satisfactory  and  the  solute  behaves 
ideally  its  rate  of  escape  should  be  proportional  only  to  the  concentration 
gradient.  The  rate  should  thus  follow  first  order  reaction  kinetics,  and  when 
the  logarithm  of  the  decrease  in  concentration  is  plotted  against  time,  a 
straight  line  should  be  obtained. 
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Fig.  2.  Escape  rate  curves  of  bacitracin  and  subtilin. 


This  has  now  been  accomplished  with  many  different  solutes.  Fig.  2  is 
an  example  with  two  different  peptides — bacitracin  (M.W.  1,422)  and  sub- 
tilin (M.W.  3,300)  using  18/32  Visking  cellophane.  If  a  mixture  of  equal 
parts  of  the  two  should  be  taken  the  dialysis  could  be  interrupted  after 
50  minutes  when  80%  of  the  bacitracin  would  be  out,  but  only  20%  of 
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the  subtilin.  A  series  of  such  cells  might  be  employed  in  countercurrent 
manner  as  in  countercurrent  distribution.  At  each  stage  the  dialysate  could 
be  evaporated  and  put  inside  the  sac  of  the  next  member  of  the  series.  Thus 
from  each  cell  80%  of  the  bacitracin  would  be  moved  forward  at  each  stage, 
but  only  20%  of  the  subtilin.  The  familiar  mathematics  of  countercurrent 
distribution  (the  binomial  theorem)  is  directly  applicable  and  will  tell  us 
that  the  bacitracin-subtilin  mixture  is  an  easy  separation  requiring  only  a 
few  stages.  Useful  separations  with  peptides  and  derivatives  have  been 
accomplished  this  way.^-® 

The  straightness  of  the  line  with  a  supposedly  pure  substance  affords  a 
measure  of  adherence  to  ideality  in  the  solvent  used  and  of  the  homogeneity 
with  respect  to  size  and  shape.  In  the  countercurrent  process  a  straight  line 
represents  ideahty  and  is  desirable  just  as  a  linear  partition  isotherm  repre- 
sents ideality  in  countercurrent  distribution.  An  unfavorable  solvent  can 
produce  molecules  of  different  sizes  from  an  otherwise  pure  solute  by  pro- 
moting association  or  theoretically  by  causing  a  fraction  of  the  molecules 
to  assume  a  shape  different  from  the  others  but  perhaps  not  instantaneously 
interconvertible. 
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Fig.  3.  Escape  rate  curves  obtained  with  a  sample  of  protamine. 


Where  a  mixture  of  sizes  is  present  the  deviation  from  a  straight  line  may 
take  the  form  of  distinct  breaks  in  the  curve  as  shown  in  Fig.  3  which  was 
obtained  with  a  20  mg.  sample  of  a  crude  protamine.  Curve  1  was  obtained 
by  weight  analysis  of  the  dialysate  fractions,  but  Curve  2  was  obtained  by 
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the  Moore-Stein  ninhydrin  procedure.'  The  figures  at  each  point  on  the 
weight  curve  give  the  ninhydrin  values  per  mg.  and  definitely  indicate  a 
range  of  sizes. 

The  first  35%  of  the  solute  to  pass  was  recovered  and  put  in  a  second 
cell.  Now  weight  analysis  gave  Curve  3  and  ninhydrin  gave  Curve  4.  Here 
a  mixture  of  sizes  is  definitely  being  fractionated. 

A  rough  survey  of  the  solvent  question  and  other  variables  intimately 
involved  with  it  such  as  pH,  salt  content,  temperature,  etc.,  seemed  to  show 
that  with  the  polypeptides  and  smaller  proteins  0-01n  acetic  acid  at  25° 
seemed  to  offer  an  environment  most  favorable  to  ideality.  A  survey  was 
then  made  of  a  number  of  solutes  of  increasing  molecular  weight  in  a  cell 
with  Visking  20/32  cellophane.  These  data^  are  given  in  Table  1,  where  the 
escape  rate  is  expressed  as  a  50%  escape  time.  For  comparison  a  few  data 
are  given  in  0-1n  acetic  acid  and  in  0-01n  acetic  acid  in  18/32  Visking,  a  less 
porous  casing.  The  molecular  weights  listed  are  those  from  the  literature. 

Table  1  does  not  list  all  the  solutes  studied.  A  number  were  not  included 
because  they  showed  too  great  a  deviation  from  ideality.  There  is  not  time 


Table  1 
50%  ESCAPE  TIMES  OF  VARIOUS  POLYPEPTIDES  AND  PROTEINS 


50%  Escape  Time 

Solute 

M,  W. 

01  N  HAc 

0  01  N  HAc 

001  N  HAc 

20/32 

20/32 

18/32 

Tryptophan 

204 

— 

4    Min. 

6  Min. 

Bacitracin 

1,422 

— 

15     Min. 

21  Min. 

Salmiridin* 

2,000-8,000 

— 

40    Min. 

150  Min. 

Subtilin 

3,300 

— 

54     Min. 

138  Min. 

B  Chain  from  Insulin 

3,600 

— 

42     Min. 

Will  not  pass 

Glucagon 

4,000 

— 

50     Min. 

Insulin 

5,733 

l-5Hrs. 

— 

Cytochrome  C 

12,000 

— 

60     Min. 

Ribonuclease 

13,600 

3-5  Hrs. 

2    Hrs. 

Lysozyme 

14,000 

3-5  Hrs. 

2-3  Hrs. 

Trypsin 

20,000 

— 

4     Hrs. 

Trypsinogen 

20,000 

— 

6-5  Hrs. 

Chymotrypsin 

24,500 

— 

5     Hrs. 

Chymotrypsinogen 

25,000 

12  Hrs. 

9     Hrs. 

Pit.  Lact.  Hormone 

26,000 

— 

13     Hrs. 

Gliadin 

27,000 

— 

29     Hrs. 

Ovomucoid 

28,000 

— 

35     Hrs. 

Pepsin 

35,500 

— 

80     Hrs. 

Ovalbumin 

45,000 

More  slowly 
than  Pepsin 

^ 

f 

*  A  protamine  from  the  rainbow  trout  obtained  through  the  courtesy  of  the  Nordisk 
Insulinlaboratorium,  Denmark. 
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here  to  discuss  the  individual  behavior  of  each  solute,  but  a  few  generaliza- 
tions seem  to  be  appropriate.  Firstly,  the  increase  in  molecular  weight  (M.W.) 
consistently  follows  the  same  sequence  as  the  decrease  in  escape  rate,  but 
not  in  direct  proportion.  A  relative  increase  in  M.W.  has  a  greater  effect 
as  the  maximum  range  of  size  capable  of  passing  is  approached.  Thus  the 
rate  ratio  for  bacitracin/subtilin  in  20/32  casing  is  a  little  more  than  three- 
fold, but  in  18/32  casing  it  is  more  than  six-fold.  The  selectivity,  therefore, 
becomes  greater  as  the  limiting  size  is  approached.  Since  the  list  of  proteins 
in  Table  1  includes  acidic,  neutral,  and  strongly  basic  ones  with  no  obvious 
effect  on  the  relationship  of  escape  rate  to  M.W.,  it  would  appear  that 
charge  effects  are  relatively  unimportant  in  the  determination  of  escape  rate. 

The  results  with  two  pairs  of  proteins  are  particularly  striking.  These  are 
trypsin-trypsinogen  and  chymotrypsin-chymotrypsinogen.  Although  each 
enzyme  is  so  closely  related  in  size  to  its  zymogen  that  the  accepted  methods 
of  molecular  weight  do  not  differentiate  them,  a  striking  difference  in  escape 
rate  is  noted  here.  It  would  seem  that  this  must  reflect  a  difference  in  shape. 
Indeed,  such  a  theory  is  in  line  with  the  published  results  of  Neurath  and 
co-workers.^ '^ 

If  the  escape  rate  of  proteins  in  this  type  of  cell  is  really  governed  almost 
entirely  by  its  overall  outside  dimensions,  then  the  shape  of  the  molecule 
should  also  have  a  strong  influence.  The  proteins  in  Table  1  are  all  known 
to  be  quite  globular  in  shape  except  under  conditions  which  denature  them. 
We  have  accordingly  investigated  the  effect  of  denaturing  conditions  on 
dialysis  rate.  These  include  the  effect  of  temperature,  pH,  urea,  trichloro- 
acetic acid,  oxidation  with  performic  acid,  etc.  In  all  cases  where  these 
conditions  are  known  to  cause  denaturation,  reversible  or  otherwise,  a 
strikingly  slower  dialysis  rate  has  been  observed. 

Table  2 
EFFECT    OF    TEMPERATURE    (IN    DISTILLED    WATER) 


Substance 

50%  Escape  Time  at 

4° 

25° 

41° 

65° 

Bacitracin  A 
Ribonuclease 
Lysozyme 
Chymotrypsin 

50  Min. 
7  0  Hrs. 

15  Min. 
2  0  Hrs. 
2-2  Hrs. 
8  0  Hrs. 

6  Min. 

I  -2  Hrs. 

II  Hrs. 

6  Min. 
2-5  Hrs. 
48  Min. 
>60  Hrs. 

The  effect  of  temperature  is  shown  in  Table  2.  Ribonuclease  is  known^° 
to  undergo  reversible  unfolding  at  a  temperature  of  65°.  This  is  reflected 
here  by  a  slower  escape  rate  at  65°  than  at  41°.  Lysozyme,  on  the  other  hand, 
is  known  to  be  very  stable  and  shows  a  faster  escape  rate  at  65°  than  at  41° 
One  might  infer  from  these  data  that  a  mixture  of  the  two  proteins  could 
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be  separated  by  dialysis  at  65°  even  though  their  molecular  weights  are  very 
similar. 

Table  3 
EFFECT    OF    pH    ON    50%    ESCAPE    TIME 


HCl 

HCl 

Acetic 

Acetic 

Distilled 

a-Picoline 

Ammonia 

Substance 

003  N 

0  003  N 

0-1  N 

001  N 

water 

001  N 

001  N 

pH  1-5 

pH2-6 

pH2-9 

pH3-4 

pH5-6 

pH7-2 

pH9-2 

Bacitracin  A 

48  Min. 

35  Min. 

15  Min. 

15  Min. 

15  Min. 

77  Min. 

Insulin 

— 

2-4  Hrs. 

1  -5  Hrs. 

— 

— 

— 

— 

Ribonuclease 

— 

3-5  Hrs. 

3-2  Hrs. 

2-2  Hrs. 

20  Hrs. 

2-8  Hrs. 

Very  slow 

Oxidized 

Ribonuclease 

— 

— 

— 

5-2  Hrs. 

3-8  Hrs. 

Very  slow 

— 

Lysozyme 

7-2  Hrs. 

2-8  Hrs. 

3-2  Hrs. 

2-3  Hrs. 

2-3  Hrs. 

1  -8  Hrs. 

4-7  Hrs. 

Trypsin 

— 

— 

— 

4  0  Hrs. 

3-6  Hrs. 

3-6  Hrs. 

— 

Chymotryp- 

sin 

— 

— 

— 

5  0  Hrs. 

8  0  Hrs. 

3-8  Hrs. 

— 

Chymotryp- 

sinogen 

~ 

9-0  Hrs. 

4-9  Hrs. 

16-3  Hrs. 

Table  3  shows  the  effect  of  pH  at  25°.  The  results  with  bacitracin  were 
obtained  with  18/32  Visking,  but  20/32  Visking  was  used  for  all  the  others. 
Bacitracin  is  known  to  be  unstable  below  a  pH  of  2  and  above  a  pH  of  7. 
Ribonuclease  shows  the  behavior  to  be  expected  and  is  quite  different  from 
oxidized  ribonuclease.  Lysozyme  shows  a  markedly  slowed  rate  of  dialysis 
only  at  the  pH  extremes.  Autodigestion  probably  plays  a  role  with  trypsin 
and  chymotrypsin  at  the  higher  pH. 


Table  4 
DIALYSIS    IN    UREA    SOLUTION 


50  %  Escape  Time  in 

Substance 

H2O 

1  M 

3M 

6M 

9M 

Bacitracin  A 

15  Min. 

19  Min. 

19  Min. 

20  Min. 

33  Min. 

Insulin* 

1  -5  Hrs. 

2-5  Hrs. 

2-5  Hrs. 

5  0  Hrs. 

5  0  Hrs. 

Ribonuclease 

2  0  Hrs. 

2  0  Hrs. 

2-7  Hrs. 

3-7  Hrs. 

8  0  Hrs. 

Ox.  Ribonuclease 

3-8  Hrs. 

3-8  Hrs. 

6-8  Hrs. 

— 

8-6  Hrs. 

Lysozyme 

2-3  Hrs. 

1  -9  Hrs. 

21  Hrs. 

2-8  Hrs. 

3-4  Hrs. 

Cyt.  C. 

— 

1  -5  Hrs. 

1-3  Hrs. 

1  -9  Hrs. 

4-2  Hrs. 

Trypsin 

3-6  Hrs. 

3-5  Hrs. 

3-5  Hrs. 

7-6  Hrs. 

200  Hrs. 

Chymotrypsin 

8  0  Hrs. 

6-2  Hrs. 

6-2  Hrs. 

6-2  Hrs. 

45  0  Hrs. 

Chymotrypsinogen 

4-9  Hrs. 

5-5  Hrs. 

6-4  Hrs. 

10-5  Hrs. 

68-0  Hrs. 

*  Done  in  OlN  acetic  acid. 
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The  effect  of  urea  at  25°  is  shown  in  Table  4.  Again  there  is  good  correla- 
tion with  the  known  effect  of  various  concentrations  of  urea  on  these  pro- 
teins. The  tendency  to  unfold  could  be  followed  by  viscosity  measurements 
or  by  measuring  specific  optical  rotations.  Thus  far,  only  the  latter  has  been 
studied.  A  solution  giving  an  increase  in  optical  activity  has  also  shown 
a  decrease  in  escape  rate,  both  in  the  urea  studies  and  in  the  pH  studies. 

Proteins  are  known  to  associate  in  a  favorable  environment  to  give  dimers, 
trimers,  etc.  Indeed,  with  certain  representatives  this  tendency  has  been  so 
strong  that  the  smallest  dissociable  unit  has  been  recognized  only  after  pro- 
longed study.  Insulin  has  been  an  example  of  this  type  of  substance,^^  but 
was  finally  recognized  as  having  a  dissociable  unit  as  small  as  6,000.  This 
value  fits  very  well  into  the  series  in  Table  1.  The  escape  curve  in  0-lN 
acetic  acid  and  20/32  Visking  for  material  purified  by  countercurrent  dis- 
tribution (CCD)  is  shown  in  Fig.  4.  The  escape  curve  for  a  sample  of  amor- 
phous commercial  insulin  is  also  shown.  Perhaps,  in  this  latter  a  certain 


•  Insulin  A  peak 
o  Amorphous  insulin 


\t        16 
Hours 


20 


24 


28 


Fig.  4.  Escape  rate  curves  with  insulin  preparations. 


percentage  of  the  molecules  have  already  begun  to  associate  preliminary  to 
the  formation  of  insoluble  fibrils.  ^^  Irrespective  of  the  truth  of  this  possi- 
bility, when  insulin  is  brought  into  solution  at  a  concentration  of  1  %  with 
a  minimal  amount  of  ammonia  the  pH  is  about  8-0,  and  when  this  solution 
is  placed  inside  the  sac  of  the  cell  with  water  outside,  the  protein  will  not 
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diffuse  through  20/32  Visking.  Insulin  is  well  known  to  aggregate  at  the 
higher  pH  values. 

The  A  chain  of  insulin  prepared  by  performic  acid  oxidation  and  purified 
by  CCD  did  not  give  a  straight  line  in  the  dialysis  study  at  any  pH,  and 
scarcely  passes  the  membrane  above  pH  7.  It  is  apparently  completely  in 
the  extended  form  with  a  high  optical  activity.  On  the  other  hand,  the  B 
chain  gave  a  fairly  ideal  curve  much  like  the  purified  insulin  of  Fig.  4  except 
that  it  diffused  through  the  membrane  at  a  faster  rate. 

Very  early  in  the  study  with  proteins,  it  was  discovered  that  salt  in  the 
solution  had  a  retarding  effect  on  the  rate  of  dialysis.  As  an  example,  escape 
curves  at  25°  and  20/32  Visking  with  ribonuclease  in  solutions  of  different 
sodium  chloride  concentrations  are  shown  in  Fig.  5.  In  each  case  a  break 
in  the  curve  was  noted,  but  at  a  different  point  on  the  curve  for  each  salt 
concentration.  Studies  at  40°  showed  a  faster  overall  escape  rate  but  similar 
breaks  in  the  curve  and  were  otherwise  comparable. 


Hours 

Fig.  5.  Escape  rate  curves  of  ribonuclease  in  aqueous  solutions  with  various  amounts 
of  sodium  chloride. 


If  the  50%  time  is  plotted  against  ionic  strength  the  curve  shown  in  Fig.  6 
is  obtained.  When  magnesium  sulfate  was  used  one-fourth  the  molar  con- 
centration was  found  to  have  the  same  quantitative  effect  as  sodium  chloride. 
Thus  the  ionic  strength  of  the  solution  seems  to  be  important  rather  than 
the  specific  ion  involved. 

An  attempt  was  made  to  isolate  the  faster  diffusing  protein  before  the 
break  for  a  run  with  no  salt  and  also  the  part  emerging  after  the  break. 
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These  two  fractions  showed  no  significant  difference  in  enzymatic  activity, 
optical  activity,  or  in  their  chromatographic  behavior  on  an  ion  exchange 
column.^'*  When  both  fractions  were  separately  re-run  in  the  dialysis  pro- 
cedure there  was  a  noticeable  difference  in  the  slope  and  position  of  the 
slower  component,  but  quantitatively  much  less  than  would  have  been 
expected  from  the  initial  run. 


20  r 


Effect  of  salt  in  0.01M  acetic  acid 


.00001 


.0001       .001        .01         .1 
Ionic   atpength 


Fig.  6.  Effect  of  ionic  strength  on  50  %  escape  time. 


That  the  effect  noted  was  due  to  the  solute  rather  than  to  some  property 
of  the  membrane  was  shown  by  an  experiment  with  a  larger  amount  of 
the  protein  initially.  At  a  time  just  prior  to  the  break  in  the  curve  the 
inside  solution  was  transferred  directly  to  a  new,  freshly  prepared  membrane 
and  cell.  The  effect  from  this  point  on  was  much  more  pronounced  with  a 
retarded  escape  rate  and  with  the  break  much  nearer  the  origin.  This  de- 
monstrated that  the  effect  was  not  due  to  a  tendency  of  the  membrane  to 
become  'plugged'.  The  most  likely  explanation  of  the  break  in  the  curve 
would  appear  to  be  based  on  the  possibility  of  more  than  one  configuration, 
state  of  aggregation,  degree  of  folding  or  shape  in  a  single  protein  and  with 
a  sufficiently  slow  rate  of  interconversion  so  that  two  escape  rates  are  regi- 
stered in  the  curves. 

Thus  far,  different  membranes  from  the  same  roll  have  shown  remarkable 
uniformity.  Even  from  roll  to  roll  in  the  same  size  there  has  been  good 
uniformity.  However,  it  would  appear  advisable  to  standardize  each  roll 
since  two  20/32  rolls  obtained  recently  have  shown  a  slower  escape  rate 
than  the  earlier  ones  used  to  accumulate  most  of  the  data  presented  in  this 
paper. 

The  escape  rate  can  be  reproducibly  increased  by  stretching  a  wet  mem- 
brane and  decreased  by  acetylating  it.  A  slower  membrane  when  acetylated 
gives  a  slower  escape  rate  for  a  solute  (e.g.,  with  18/32  bacitracin  barely 
passes)  than  a  faster  one  (e.g.,  20/32  gives  a  50%  escape  time  of  3  hours)« 

Hps 
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Both  acetylated  membranes  readily  pass  amino  acids.  The  further  possi- 
bihties  here  do  not  require  emphasis. 

It  would  appear  that  the  considerable  data  we  have  now  accumulated  all 
support  the  view  that  membrane  diffusion  with  cellophane  can  be  studied 
in  such  a  way  that  the  size  and  (or)  shape  of  the  molecule  is  the  predominant 
property  being  measured.  Perhaps,  in  conclusion,  a  speculation  in  regard 
to  the  basis  for  the  membrane  selectivity  noted  is  in  order. 

Regardless  of  how  thin  a  membrane  may  be,  it  can  be  regarded  as  offer- 
ing a  stationary  structure  with  interstices  or  passages  through  which  the 
molecules  must  diffuse.  These  passages  will  not  all  be  of  the  same  size. 
Thus  a  cross  section  might  appear  to  be  a  mosaic  of  different  sized  holes 
as  shown  in  Fig.  7.  Some  of  the  holes  will  be  smaller  than  the  molecular 


Fig.  7.  Schematic  drawing  of  a  hypothetical  cross  section  of  a  membrane. 

dimensions  of  the  solutes  trying  to  diffuse  through  them  and  thus  a  small 
solute  would  have  a  larger  total  cross-sectional  free  area  through  which  to 
diffuse  than  a  larger  one  would  have.  This  differential  could  be  considerably 
enhanced  as  the  point  of  total  exclusion  of  passage  is  approached.  If  this 
simple  mechanistic  theory  is  correct,  the  selectivity  of  a  membrane  could 
be  a  function  of  the  size  distribution  of  the  passages  through  which  the 
solutes  must  diffuse. 

The  simple  theory  briefly  outhned  is  not  new.  It  has  been  considered 
before  in  the  extensive  studies  dealing  with  the  passage  of  solutes  through 
membranes.  No  attempt  will  be  made  here  to  discuss  this  literature.  A  recent 
excellent  review^^  dealing  with  capillary  permeability  covers  the  subject, 
including  present  information  about  the  pore  size  of  certain  membranes, 
the  relation  of  this  to  the  rate  of  passage  of  solutes  with  known  effective 
diameters,  etc.  The  factors  controlling  restricted  diffusion^*  have  been  treated 
in  a  mathematical  way. 

In  spite  of  these  studies,  Spandau^^'^^  has  presented  data  to  show  that 
dialysis  with  many  smaller  solutes  can  be  performed  in  such  a  way  that  the 
rate  seems  to  correlate  better  with  the  inverse  of  the  square  root  of  the  mole- 
cular weight  than  it  does  with  molecular  volume.  Our  experience  with  the 
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polypeptides  and  proteins  indicates  that  both  the  molecular  weight  and  the 
shape  must  be  considered. 

If  the  molecular  weight  is  known  then  dialysis  can  be  a  useful  tool  for 
studying  the  shape  of  the  molecule  in  a  particular  solution.  Something  about 
the  change  in  shape  brought  about  by  a  change  in  environment  can  be  re- 
vealed. Information  about  the  state  of  aggregation  also  can  be  derived. 
Conversely,  if  the  shape  of  the  molecule  is  known  a  good  estimate  of  the 
molecular  weight  can  be  made. 

Finally,  dialysis  as  outlined  here  can  be  a  very  useful  separation  tool 
which  at  the  same  time  offers  a  degree  of  characterization  with  respect  to 
size  homogeneity. 
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Quantitative  Papierchromatographie  der 
Aminosäuren  durch  Isotopenverdünnung 

F.  TURBA 

Physiologisch-chemisches  Institut  der  Universität  München 


Eine  Verbindung  mit  'anormalem'  Isotopengehalt  wird  durch  die  üblichen 
physikalisch-chemischen  Methoden  des  Laboratoriums  nicht  von  ihrem  'nor- 
malen' Analogen  getrennt  (Hevesy^).  Setzt  man  z.B.  einem  Proteinhydrolysat 
mit  einem  bestimmten  Gehalt  an  Glutaminsäure  eine  definierte  Menge  W^- 
Glutaminsäure  zu,  so  läßt  sich  aus  der  eingetretenen  'Verdünnung'  des 
Isotops  auch  in  einer  nicht  quantitativ  isolierten,  aber  reinen  Probe  der 
Glutaminsäuregehalt  des  Hydrolysats  berechnen  ('Isotopenverdünnungs- 
methode'  nach  Rittenberg^).  Man  kann  die  zu  bestimmende  Verbindung 
auch  mit  einem  isotop  markierten  Rest  substituieren  und  einen  Überschuß 
des  unmarkierten  Derivats  zusetzen  ('Isotopenderivatmethode'  nach  Keston, 
Udenfriend  u.  Mitarbeitern.^'^ '^  Wir  haben  Versuche  unternommen,  das 
Isotopenverdünnungsverfahren  an  C^^-markierten  Aminosäuren  mit  der 
Papierchromatographie  zu  kombinieren,  um  diese  Methode  dadurch  quanti- 
tativ zu  gestalten,^ 

BESCHREIBUNG    DER    METHODE 

0,5-1,0  mg  Protein  werden  in  5  ml  glasdestilherter,  20%iger  Salzsäure  in 
einem  Reagensglas  (mit  Schliff)  gelöst  und  die  C^^-Analogen  der  zu  bestim- 
menden Aminosäuren  in  solcher  spezifischer  Aktivität  zugesetzt,  daß  das 
Mengenverhältnis  inaktive:  aktiver  Aminosäure  etwa  10:1  und  die  meßbare 
Aktivität  etwa  1000-4000  Impulse/min  beträgt.  Man  verschUeßt  die  Gläser 
mit  Schhff"stopfen  und  hydrolysiert  mindestens  24  Stunden  bei  110°  im 
Aluminiumblock.  Danach  werden  die  Reagensgläser  direkt  an  einen  Rota- 
tionsverdampfer angeschlossen,  die  Lösung  im  Vakuum  mehrmals  zur 
Trockne  gebracht  und  genau  neutralisiert.  Die  2-dimensionale  Papier- 
chromatographie erfolgt  wie  üblich  (z.B.:  1.  Dimension:  sec.-Butanol/ Amei- 
sensäure/Wasser 70  :  15  :  15;  2.  Dimension:  Methyläthylketon/Pyridin/ 
Wasser  70  :  15  :  15).'  Man  lokahsiert  die  zu  bestimmenden  Aminosäuren 
im  Chromatogramm  durch  Radio-autographie  (24-48  Std.  Aufpressen  auf 
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Röntgenpapier)  und  schneidet  danach  die  betreffenden  Stellen  des  Chromato- 
gramms  so  aus,  daß  bei  eventuell  nicht  vollständiger  Trennung  nur  der  reine 
Anteil  der  Aminosäure  zur  Elution  gelangt.  Als  Kontrollen  werden  Aus- 
schnitte aminosäurefreier  Stellen  des  Chromatogramms  gleicher  Größe  nach 
Trocknung  über  Phosphorpentoxyd  bei  0, 1  mm  und  50°  für  1 2  Std.  (zur  Entfer- 
nung von  Ammoniak  und  flüchtigen  Ammoniumsalzen)  ebenfalls  mit  5  ml 
Wasser  (in  Iml-Portionen)  in  einer  Mikroglasfritte  eluiert.  Die  wenn  nötig 
neutralisierten  Eluate  (Vermeidung  des  Ausfällens  von  Desoxycholsäure, 
s.u  !)  werden  auf  die  Meßschälchen  überführt. 

Messung  der  Radioaktivität.  Es  erwies  sich  als  nahezu  unmöglich,  Amino- 
säuren ohne  spreitenden  Zusatz  und  auf  Schälchen  mit  ebenem  Boden  in 
gleichmäßiger  Schicht  aufzutragen,  wie  das  für  reproduzierbare  Messungen 
unentbehrlich  ist.  Am  günstigsten  erwiesen  sich  V2A-Schälchen  mit  gewölb- 
tem Boden  (Abb.  1).  Wir  fanden  weiter,  daß  bei  Zusatz  von  200  /Lig  Natrium- 


Abb.  1.  V2A-Schälchen  mit  schwach  gewölbtem  Boden  und  Rand  zur  Messung  der 
Ci* — Aktivität  von  Aminosäuren.  Die  Schälchen  wurden  zwischen  einer  Matrix  und 
einem  Stempel  mit  einheitlichen  Maßen  gepresst.  Maße  in  mm. 

Desoxycholat  zur  Probe  das  Auftragen  in  Form  eines  gleichmässig  dünnen 
Films  gehngt.  Außerdem  wird  durch  diesen  Zusatz  erreicht,  daß  die  Än- 
derung der  Selbstabsorption,  bedingt  durch  die  verschiedenen  Mengen  an 
Aminosäuren,  bei  Vorgabe  der  relativ  großen  Menge  an  organischer  Sub- 
stanz keine  Rolle  mehr  spielt  (Abb.  2).  Die  Messung  als  C^^Og  im  Bernstein- 


20  40  60      y/2-8cnn2 

Abb.  2.  Einfluß  der  angewandten  Menge  der  Aminosäuren  in  /^g/cmo  auf  die  Selbst- 
absorption von  C^*.  200  ^g  Na-Desoxycholat  wurden  jeder  Probe  hinzugefügt. 


Balientine-Zählrohr^-^  oder  im  dynamischen  Elektrometer  ist  umständlicher, 
da  sie  eine  Veraschung^*'  der  Substanz  voraussetzt.  Die  Fällung  als  BaC^^'Os 
hat  sich  bei  den  vorhegenden  kleinen  Mengen  nicht  bewährt.  Außerdem  hat 
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die  Messung  der  C^^-Aminosäuren  als  solche  den  Vorteil,  daß  man  die 
gesamte,  ungeteilte  Probe  anschließend  zur  Konzentrationsbestimmung  mit 
Ninhydrin  verwenden  kann,  was  die  Genauigkeit  erhöht;  die  Desoxychol- 
säure  stört  dabei  nicht. 

Messung  der  Konzentration.  Für  die  Konzentrationsbestimmung  der  Amino- 
säuren kam  aus  Gründen  der  Empfindlichkeit  und  Genauigkeit  nur  die 
kolorimetrische  Ninhydrinmethode  in  Frage;  sie  erfolgte  nach  der  Vorschrift 
von  Moore  und  Stein, ^^  aber  im  evakuierten  System  (Van  Slyke^^)  zur 
Verhinderung  des  Entweichens  von  C^^02  (vgl.  Abb.  3).  Nach  Entwicklung 
der  Farbe  (20  min,  100°)  wurde  10  min.  Luft  durch  das  Reaktionsgefäß  in 
Barytwasser  gesaugt. 


X  ^ 


X 


Abb.  3.  Reagensglas  mit  SchlifFstopfen  und  kapillaren  Ansätzen  zur  kolorimetrischen 
Ninhydrinbestimmung  von  Ci*-Aminosäuren.  (o)  Stellung  des  Stopfens  zum  Evakuieren 
vor  dem  Erhitzen,  {b)  Einsaugen  von  Luft  nach  Abkühlen  der  Analysenprobe,  (c)  Durch- 
leiten von  Luft  zur  Entfernung  von  C^*02.  {d)  Verschluß  des  evakuierten  Röhrchens 
während  der  Farbentwicklung. 


^ 


Berechnung.  Diese  erfolgte  nach  a=b 


ft°-) 


(a= inaktive  Aminosäure  in 


)Lig;  b= zugesetzte  aktive  Aminosäure  in  /ig;  Co= Impulse/min:  ixg  für  die 
zugesetzte  aktive  Aminosäure;  C  =  Impulse/min  :  jug  für  die  isolierte 
Analysenprobe).  Zur  Genauigkeit  und  Fehlerberechnung  vgl,  Rittenberg^ 
bzw.  RiedeP^.  Unter  günstigen  Voraussetzungen  (Cq  :  C^IO;  gemessene 
Aktivität  1000-4000  Impulse/min)  kann  man  einen  Fehler  von  2-3%  erwarten 
(reiner  Meßfehler  der  Aktivitätsmessung  <1%,^*  Gesamtfehler  der 
Aktivitätsbestimmung  2-3  %  ;  Fehler  der  Konzentrationsbestimmung  <  1  %). 
Vgl.  Tab.  1. 

DISKUSSION 

Der  Hauptvorteil  der  Methode  beruht  darin,  daß  an  keiner  Stelle  des  Arbeits- 
gangs quantitative  Ausbeuten  erforderlich  sind  (selbst  die  Übertragung  der 
Substanz  vom  Meßschälchen  nach  der  Bestimmung  der  C^*-Aktivität  zur 
kolorimetrischen  Bestimmung  muß  nicht  vollständig  sein,  wenn  man  den 
verbleibenden  Rest  der  Aktivität  zurückmißt).  Damit  wird  das  Verfahren 
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Tab.  1 
BESTIMMUNG    VON    AMINOSÄUREN    IM   GEMISCH   DURCH   ISOTOPEN- 
VERDÜNNUNG     MIT      C14-AMIN0SÄUREN      UND       PAPIERCHROMATO- 
GRAPHIE 


Hydrolysat 

ein- 
gesetzte 
Menge 
Protein 

Markierte 
Aminosäure 

Iso- 
lierte 
Probe 
C 

J/min: 
Mg 

a 
Inak- 
tive 
Amino- 
säure 

Mg(% 
d.Th) 

gefundene 
g  Amino- 
säure/100 
g  Protein 

Litera- 
turan- 
gabent 

Markie- 
rungs- 
ort 

b 

Mg 

Co 

J/min  : 
Mg 

Künstl. 
Gemisch* 

69,6  /xg 
(Alanin) 

Alanin 

7,5 

2666/7,5 

1252/37 

70,8 
(101%) 

— 

— 

jS-Lacto- 
globulin 

720 /Lig 

Alanin 

7,5 

2666/7,5 

1500/31 

51,3 

7,12 

6,7;î 
7,11% 

^-Lacto- 
globulin 

720 /ig 

l-C'*- 

Phenyl- 
alanin 

3,2 

3026/3,2 

1250/11,4 

27,8 

3,86 

3,78î 

*  Das  Künstliche  Gemisch  war  einem  Hydrolysat  nachgebildet. 

t  Wahrscheinlichster  Wert  fettgedruckt. 

Î  G.  R.  Tristram,  Advances  in  Protein  Chemistry,  5  (1949). 

§  A.  S.  Keston.  S.  Udenfriend  u.  R.  K.  Cannan,  /.  Am.  Chem.  Soc,  68,  1390  (1946). 

unabhängig  von  den  Fehlerquellen  anderer  Analysenmethoden.  Bei  der 
Hydrolyse  werden  manche  Aminosäuren  teilweise  zerstört  und  entgehen 
daher  normalerweise  z.Tl.  der  Bestimmung,  bes.  bei  Hydrolyse  in  Gegen- 
wart von  Zucker;  von  Glyko-;  Lipo-  und  Chromoproteiden ;  von  Extrakten 
aus  biologischem  Material).  Bei  der  Isotopenverdünnung  ist  dies  nicht  der 
Fall  (Tab.  2). 

Tab.  2 

VERLUSTE  BEI  DER   ANALYSE  VON   METHIONEN   NACH    ERHITZEN 
IN    20%    HCl    IN    GEGENWART    VON    GLUCOSE 

42,2  /xg  Methionin,  80,0  Mg  Glucose,  2  ml  20  %ige  glasdest.  HCl,  24  Std.,  100°.  Chromato- 
graphie eindimensional  in  sec-Butanol/Ameisensäure/Wasser. 


Ci^Hg-Methionin 

zugesetzt  zu 
inakt.  Methionin 

b 

Co 

I/min:  Mg 

C 

I/min:  /xg 

a 

Ausbeute 
%d.Th. 

Verlust  bei 

vor  Erhitzen  mit 
Säure 

8.4  Mg 

4430/8.4 

2900/33.8 

43,2 

102,5 

— 

nach        „        „ 

8.4  Mg 

4430/8.4 

2620/14.2 

15,6 

36,2 

'Hydrolyse' 

nach  Elution  aus 
Chromat. 

8.4  Mg 

4430/8.4 

4156/15.2 

7,8 

18.5 

Chromato- 
graphie 
Elution 
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Tab.  2  zeigt  das  Verhalten  von  Methionin  bei  Erhitzen  in  Gegenwart  von 
Glucose  mit  20%  HCl.  Setzt  man  C^^-markiertes  Methionin  vor  der  'Hydro- 
lyse' zu,  so  findet  man  die  Aminosäure  quantitativ  wieder,  da  die  Zerstörung 
das  inaktive  wie  das  aktive  Methionin  gleichermaßen  betrifft.  Bei  Zusatz 
nach  'Hydrolyse'  dagegen  sind  nur  36,2%,  nach  der  Elution  aus  dem  Chroma- 
togramm  nur  noch  18,5%  bestimmbar  (zu  Verlusten  beim  Chromato- 
graphieren in  versch.  Lösungsmitteln,  bes.  Phenol,  Trocknen  bei  versch. 
Temperaturen,  durch  Lichteinwirkung  usw.  vgL^^-^^) 

Auch  in  normalen  Hydrolysaten  erfassen  wir  nur  40-60%  der  eingesetzten 
Mengen  bei  der  Konzentrationsbestimraung  am  Schluß  des  Analysenwegs 
in  Substanz  (vgl.  Tab.  1). 

Die  zur  Kompensation  der  bei  der  Papierchromatographie  eintretenden 
Verluste  von  Fischer  u.  Dörfel"  vorgeschlagene  Methode,  zu  jeder  Analysen- 
probe ein  auch  quantitativ  möglichst  identisches  künstliches  Aminosäure- 
gemisch unter  genau  gleichen  Bedingungen  mitzuanalysieren  und  die  Ana- 
lysenwerte auf  seine  Ausbeuten  zu  beziehen,  ist  wegen  der  notwendigen 
Vorversuche  langwierig  und  weniger  genau  als  die  Isotopenverdünnung.  Ein 
weiterer  wesentlicher  Vorteil  des  hier  beschriebenen  Verfahrens  besteht  in 
der  genauen  und  verlustlosen  Markierung  der  Lage  der  Aminosäuren  im 
Chromatogramm  durch  Radioautographie.  Diese  Tatsache  ermöglicht  die 
Anwendung  der  wirksameren  2-dimensionalen  Papierchromatographie  ohne 
teilweises  Zerstören  der  Aminosäuren  durch  Besprühen  mit  farbbildenden 
Reagentien,^^-"  Erzeugung  von  Fluoreszenz  durch  Erhitzen-"  usw.  Auch 
gelingt  die  Trennung  aller  Aminosäuren  im  gleichen  Chromatogramm,  wäh- 
rend beim  eindimensionalen  Verfahren  stets  mehrere  Versuche  mit  ver- 
schiedenen Lösungsmitteln  notwendig  sind  (McFarren).  Schliesslich  verdient 
die  quantitative  Bestimmung  in  Lösung  nach  Elution  den  Vorzug  vor  der 
Kolorimetrie  im  Papier  z.B.  nach  Besprühen  mit  Ninhydrin.^^ 

Vor  den  beschriebenen  Isotopenderivatmethoden  (mit  J^^^  und  S^^-p- 
Jodphenylsulfonyl-Aminosäuren)  hat  das  entwickelte  Verfahren  den  Vorteil 
besserer  Trennungen  bei  der  Papierchromatographie  und  der  dauernden 
Verwendbarkeit  der  markierten  Substanzen  infolge  der  hohen  Halbwertszeit 
von  C^*,  vor  den  N^^- Verfahren  der  einfacheren  Messung  des  radioaktiven 
Isotops  und  der  Benutzung  kleinerer  Mengen,  wie  sie  die  Papierchromato- 
graphie erfordert. 

Die  einzig  ausschlaggebende  Fehlerquelle  des  Verfahrens  ist  die  Verun- 
reinigung der  Analysensubstanz  mit  Ninhydrin-positivem  Material,  das  die 
Ergebnisse  fälschen  würde.  Da  jedes  Chromatographiepapier  auch  nach 
Auswaschen  noch  geringe  Mengen  Verunreinigungen  enthält,  die  mit  Nin- 
hydrin  reagieren  (vgl.  Abb.  4),  ist  es  notwendig,  von  den  Analysenwerten 
entsprechende  kleine  Korrekturen  in  Abzug  zu  bringen.  Die  Benutzung  von 
reinem  Glasfaserpapier  zur  Chromatographie,  das  frei  von  diesem  Fehler 
ist,  setzt  das  Imprägnieren  mit  Puffersalzen  voraus;  die  C-^'*Messung  kann 
danach  nur  noch  als  C^^Og  erfolgen.  Das  Gleiche  gilt  für  die  Anwendung 
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dieses  Verfahrens  auf  Eluate  mit  einem  Gehalt  an  nichtsublimierbarcn  Salzen 
aus  lonenaustauschersäulen.  Ammoniak  und  Ammoniumsalze  werden  im 
Vakuum  entfernt  (s.o.). 


\>c 


Abb.  4.  Auswaschen  von   Chromatographiepapier.   300  cm2   Schleicher-Schüll-Papier 
2043  b  wurden  im  Durchlaufverfahren  mit  Wasser,  anschliessend  (beginnend  bei  |  )  mit 

5%  Trichloressigsäure,  schliesslich  (beginnend  bei  1  )  mit  5%  methanolischer  KOH 
eluiert.  Je  10  ml  des  Eluats  wurden  im  Vakuum  eingedampft  und  nach  Ausäthern  bzw. 
Neutralisation  die  Ninhydrinbestimmung  nach  Moore  und  Stein  durchgeführt,  (a)  Null- 
wert der  Reagentien;  (b)  unhydrolysiertes  Eluat;  (c)  hydrolysiertes  Eluat  (24  Std.,  110", 
20%  HCl).  Die  Extinktion  bei  570  m/x  von  1,0  entspricht  37,0  ng  Alanin. 


ANWENDBARKEIT 

Das  Verfahren  hat  Bedeutung  für  die  Bestimmung  der  genauen  Zahl 
bestimmter  Aminosäure- Reste'^'^  in  gr<)ßeren  Peptiden  und  in  Proteinen.  Es 
ist  geeignet,  Verluste  bei  der  Isolierung  von  Aminosäuren  in  biologischer 
Material  (z.B  durch  Hydrolyse,  Enteiweissung,  Entsalzung  usw.)  aus- 
zuschalten. Man  darf  erwarten,  daß  mit  seiner  Hilfe  die  Genauigkeit  von 
Bestimmungen  der  Amino-  und  Carboxylendgruppen  in  Proteinen,  die  z.Zt. 
mit  Fehlern  von  teilweise  50%  und  mehr  belastet  sind^^  und  umständlicher 
(gelegentlich  auch  unsicherer)  Korrekturen  bedürfen,  ähnlich  gesteigert 
werden  kann,  wie  bei  der  Bestimmung  von  Aminosäuren  in  Hydrolysaten; 
wir  sind  mit  solchen  Versuchen  beschäftigt. 
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SPECIFIC  PROTEINS 

Haemoglobin  and  Myoglobin 


Crystallographic  studies  of  myoglobin 

J.  C.  KENDREW 

Medical  Research  Council  Unit  for  Molecular  Biology, 
Cavendish  Laboratory,  University  of  Cambridge 


This  paper  is  an  outline  of  recent  developments  in  X-ray  studies  of  the 
structure  of  myoglobin.  These  studies  have  been  undertaken  in  our  labora- 
tory and  in  the  Royal  Institution  at  London  with  a  number  of  colleagues, 
past  and  present,  including  M.  M.  Bluhm,  G.  Bodo,  H.  M.  Dintzis,  V.  M. 
Ingram,  J.  Kraut,  R.  G.  Parrish,  P.  J.  Pauling,  D.  C,  Phillips,  Mary  Pinkerton, 
Helen  Scouloudi,  and  H.  Wyckoff.  Several  similar  projects  are  under  way 
in  other  laboratories,  all  having  the  object  of  determining  the  structure  of 
a  globular  protein  purely  by  X-ray  methods;  they  are  concerned  with  a 
variety  of  proteins,  notably  haemoglobin,  insulin,  and  ribonuclease.  Although 
the  methods  they  use,  and  the  stage  they  have  reached,  vary  from  project 
to  project,  the  myoglobin  programme  may  be  taken  as  typical  of  them  all. 
Until  four  years  ago  no  one  knew,  even  in  principle,  how  structures  as  com- 
plex as  proteins  could  be  solved;  but  Perutz's  successful  application  of  the 
so-called  method  of  isomorphous  replacement  to  haemoglobin^  resolved  a 
deadlock  in  the  subject  by  indicating  a  secure,  though  difficult,  line  of  ad- 
vance. Since  then  the  potentialities  of  this  method  in  protein  crystallography 
have  been  explored,  and  it  has  been  used  to  obtain  several  two-dimensional 
electron  density  projections  of  protein  crystals  (haemoglobin  of  horse-  and 
ox;^  myoglobin  of  sperm  whale"*  and  seal;^  ribonuclease).^-'^  Gradually  it 
has  become  clear  that  the  only  way  of  exploiting  the  new  method  usefully 
is  to  extend  it  to  three  dimensions  :  for  a  three-dimensional  electron  density 
map,  whatever  its  limitations  as  to  resolution  and  accuracy,  would  at  least 
completely  avoid  the  overlapping  of  atoms  which  makes  projections  of  com- 
plex structures  (including  all  those  just  enumerated)  impossible  to  inter- 
pret. In  most  of  the  crystalline  protein  projects  the  possibility  of  extending 
the  analysis  to  three  dimensions  is  being  explored  at  the  present  time.  This 
paper  will  show  that  at  any  rate  in  myoglobin  such  extension  is  possible  in 
principle,  although  we  have  not  yet  collected  all  the  necessary  experimental 
data,  nor  are  we  yet  certain  what  resolving  power  will  be  needed  to  reveal 
recognizable  features  of  the  structure. 


126  J.  C.  KENDREW  [9 

THE  BASIC  CRYSTALLOGRAPHIC  PROBLEM 

It  is  well  known  that  a  three-dimensional  Fourier  summation,  carried  out 
with  the  amplitudes  and  phases  of  all  the  reflexions  in  the  X-ray  pattern 
of  a  crystal  as  terms,  gives  a  representation  of  the  electron  density  in  the 
unit  cell  of  the  crystal  ;  and  that  the  degree  of  resolution  obtained  depends 
on  the  Bragg  spacing  of  the  highest-order  reflexions  included  in  the  summa- 
tion, the  number  of  reflexions  involved  being  proportional  to  the  cube  of 
the  spacing  (e.g.  in  myoglobin  about  400  for  6  Â.,  25,600  for  1  -5  Â.).  Further- 
more, if  particular  sets  of  reflexions  (known  as  'zones')  are  selected  and  their 
amplitudes  and  phases  used  in  a  ?H'o-dimensional  Fourier  synthesis,  the  result 
is  a  representation  of  the  electron  density  projected  along  a  particular  crys- 
stallographic  direction  onto  a  plane.  The  amplitudes  of  the  reflexions  are 
the  square  roots  of  the  observed  intensities  and  are  therefore  measurable 
by  photographic  or  other  means  ;  but  the  phases  are  not  directly  observable, 
and  in  this  consists  the  basic  difficulty  of  X-ray  analysis — that  experiment 
gives  one  just  half  the  information  required  to  solve  a  structure.  The  history 
of  the  subject  is  largely  a  history  of  attempts  to  discover  ways  to  get  round 
this  difficulty.  In  simple  structures  straightforward  trial-and-error  methods, 
or  sometimes  what  are  in  effect  sophisticated  variants  of  them,  have  often 
sufficed;  but  in  structures  as  complex  as  the  proteins  there  is  little  hope 
that  these  will  succeed,  and  indeed  in  the  past  they  provided  only  too  much 
free  play  for  the  exercise  of  speculative  theories  and  models  of  protein 
structure,  without  the  possibiUty  of  proving  whether  any  of  the  specula- 
tions were  wrong  or  right.  The  method  of  isomorphous  replacement  (and 
its  near  relative  the  heavy-atom  method)  was  also  developed  for  simple 
structures,  and,  where  chemically  practicable,  it  does  often  lead  to  an  un- 
equivocal solution  of  the  structure.  It  was  the  successful  application  of  this 
method  to  proteins  which  first  put  the  structure  analysis  of  their  crystals 
onto  a  firm  basis  and  it  is  now  the  principal  tool  in  all  laboratories  studying 
protein  crystal  structure.  To  illustrate  its  use  we  shall  describe  its  applica- 
tion to  myoglobin,  first  of  all  in  two  dimensions. 

The  reason  for  limiting  the  investigation  to  two  dimensions  in  the  first  instance, 
is  simply  that  in  most  crystals  the  phases  of  certain  reflexions  are  restricted  by 
the  symmetry  of  the  crystal  to  values  of  0  or  tt  ;  or,  in  other  words,  these  so-called 
real  reflexions  may  be  regarded  simply  as  having  signs,  either  positive  or  negative 
(unlike  the  generality  of  reflexions  whose  phases  may  assume  any  values  from 
0  to  Itt).  Thus  the  question  what  phase  is  possessed  by  a  reflexion  of  this  kind  can 
have  only  two  answers,  '  +  '  or  '  —  ',  whereas  to  reflexions  in  general  no  such  restric- 
tion applies.  It  is  consequently  much  easier  to  determine  the  phases  of  real  re- 
flexions; and  the  real  reflexions  from  a  given  crystal  fall  into  a  group  or  groups 
(the  real  zone  or  zones  of  reflexions)  such  that  each  zone,  when  it  is  used  as  terms 
of  a  Fourier  synthesis,  gives  a  projection  along  a  particular  crystal  axis  onto  a 
plane,  i.e.  a  two-dimensional  synthesis. 
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MYOGLOBIN    AND    THE    ISOMORPHOUS 
REPLACEMENT    METHOD    IN    TWO    DIMENSIONS 

Let  us  assume  that  an  isomorphous  heavy-atom  derivative  of  crystaUine 
myoglobin  has  been  prepared,  containing,  for  example,  one  mole  of  mercuri- 
iodide  ion  per  mole  of  protein.  This  implies  that  one  mercuri-iodide  group 
is  specifically  attached  to  a  single  site  on  each  myoglobin  molecule,  and 
that  the  resulting  complex  has  crystalhzed  in  precisely  the  same  arrange- 
ment as  normal  unsubstituted  myoglobin,  the  cell  dimensions  and  packing 
arrangement  of  the  molecules  remaining  unchanged.  We  shall  return  later 
to  the  question  how  such  derivatives  can  be  prepared,  and  assume  for  the 
moment  simply  that  it  has  been  done. 

Owing  to  the  presence  of  the  Hgl4-^  group  the  intensities  of  all  the  re- 
flexions in  the  X-ray  pattern  will  be  changed.  Even  as  weighty  an  ion  as 
mercuri-iodide  is  very  small  compared  with  the  whole  protein  molecule, 
however,  and  at  first  sight  one  might  guess  that  the  changes  in  the  diff'racted 
intensities  would  be  too  small  to  be  measured.  It  was  Perutz's  contribution 
to  realize,  in  the  course  of  his  studies  of  haemoglobin,  that  the  changes  will 
in  reality  be  very  pronounced,  in  consequence  of  the  low  average  intensity 
of  the  reflexions  from  a  unit  cell  so  uniformly  filled  with  matter  as  is  that 
of  a  protein  ;  the  average  contribution  of  a  single  specifically-situated  mer- 
cury atom  is  actually  about  one  quarter  of  the  average  contribution  of  the 
myoglobin  molecule  as  a  whole. 

The  first  step  in  the  analysis  is  to  measure  the  intensities  of  all  the  real 
reflexions  before  and  after  the  attachment  of  mercuri-iodide.  The  square 
roots  of  these  intensities  are  the  moduli  of  the  amphtudes,  |  F  |  and  |  F'  | . 
From  them  are  calculated  the  so-called  'difference  intensities'  (|Fl  —  |F'|),2 
and  these  are  used  as  terms  of  a  Fourier  synthesis.  A  Fourier  synthesis  using 
the  intensities  instead  of  the  amplitudes  of  the  reflexions  from  a  crystal  is 
known  as  a  vector  synthesis,  and  Patterson  showed  many  years  ago  that 
its  physical  significance  is  to  give  a  representation  of  all  the  interatomic 
vectors  in  the  structure,  transferred  to  a  common  origin  without  regard  to 
the  absolute  co-ordinates  of  the  atoms  at  which  each  vector  terminates.  If 
there  are  n  atoms  per  unit  cell  there  are  «^  interatomic  vectors,  so  the  result- 
ing vector  map  is  usually  hopelessly  confused  if  the  structure  is  as  complex 
as  that  of  a  protein.  By  the  subtractive  procedure  we  have  just  outlined, 
however,  we  virtually  cancel  out  all  the  atoms  of  the  protein  and  leave  in 
effect  an  empty  unit  cell  containing  only  the  heavy  atoms;  the  vector  syn- 
thesis simply  shows  the  vectors  between  these  atoms  (together  with  a  high 
peak  at  the  origin  representing  vectors  of  zero  length  between  each  atom 
and  itself).  The  result  is  shown  in  Fig.  1.  The  origin  is  in  the  centre;  the 
unit  cell  contains  two  myoglobin  molecules  related  by  a  screw  dyad  axis 
of  symmetry,  and  consequently  two  mercury  atoms  A  and  B.  The  vectors 
AB  and  BA  between  these  atoms  are  symmetrically  disposed  about  the 
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origin,  and  their  positions,  together  with  the  known  symmetry  properties 
of  the  crystal,  enable  us  to  locate  the  heavy  atoms  in  the  structure,  at  least 
as  far  as  their  x  and  z  co-ordinates  are  concerned  (the  projection  shown  is 
made  along  the  y  axis,  and  is  the  only  one  which  can  be  calculated  from  real 
reflexions  in  the  monoclinic  space  group  to  which  this  crystal  belongs). 
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Fig.  1.  (a)  Difference- vector  synthesis  of  the  mercuri-iodide  complex  of  myoglobin 
(type  A,  y  projection),  {b)  Schematic  representation  of  vectors  between  heavy  groups, 
(c)  Positions  of  mercuri-iodide  groups  in  the  unit  cell.  The  unit  cell  contains  two  myo- 
globin molecules  and  therefore  two  mercuri-iodide  groups. 

The  next  step  is  to  compute  the  contributions — positive  or  negative — 
which  a  heavy  atom  in  the  position  assigned  would  make  to  each  reflexion. 
The  signs  of  these  contributions,  which  are  calculated  in  a  straightforward 
and  unambiguous  manner,  make  it  possible  to  allot  signs  to  the  protein 
reflexions  by  inspection.  For  if  a  protein  reflexion  is  positive  and  the  con- 
tribution of  the  heavy  atom  is  likewise  positive,  the  resultant  is  a  stronger 
reflexion;  the  reflexion  is  also  strengthened  if  a  negative  heavy  atom  con- 
tribution is  added  to  a  negative  protein  term  ;  on  the  other  hand  a  positive 
heavy  atom  contribution  together  with  a  negative  protein  contribution,  or 
vice  versa,  leads  to  a  weakening  of  the  reflexion. 

The  amplitudes  of  the  protein  reflexions  having  been  measured  and  their 
signs  determined,  the  way  is  clear  to  compute  a  Fourier  synthesis,  which  is 
a  representation  of  the  electron  density  in  the  unit  cell  projected  along  the 
y  axis.  Fig.  2  shows  the  result.  It  is  computed  with  a  resolution  of  6  Â.  : 
we  are,  however,  projecting  some  31  Â,  thickness  of  protein  and  mother 
liquor  onto  each  point  (31  Â.  is  the  length  of  the  y  axis  along  which  the 
projection  is  made).  It  is  hardly  surprising,  therefore,  that  the  various  fea- 
tures of  the  structure  overlie  one  another  in  such  confusion  that  in  the 
resulting  projection  nothing  can  be  disentangled.  Increasing  the  resolution 
does  not  help  at  all;  the  same  projection  computed  with  a  resolution  of 
4  Â.  is  almost  identical  and  quite  as  unintelligible.  Of  the  correctness  of  the 
projection  there  is  no  doubt:  for  the  whole  process  of  sign  determination 
can  be  repeated  with  a  different  isomorphous  substitution  in  the  same  unit 
cell,  using,  for  example,  a  gold  atom  which  attaches  itself  to  a  different 
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position  on  the  protein  molecule,  and  the  signs  of  the  protein  reflexions  so 
derived  agree  perfectly  with  those  obtained  from  the  mercuri-iodide  sub- 
stitution. Similar  two-dimensional  projections  of  other  protein  crystals  have 
been  prepared,  as  has  already  been  mentioned,  but  in  no  case  can  the  pro- 
jection be  interpreted  in  chemical  terms.  Nevertheless  these  projections  re- 
present an  important  stage  in  the  development  of  the  subject,  since  they 
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Fig.  2.  Fourier  projection  of  Type  A  myoglobin  crystals  along  y,  including  all  terms 
of  spacing  greater  than  6-6  Â. 

are  the  first  definitely  correct  pictures  of  protein  structures  to  be  obtained. 
It  has,  however,  become  clear  that  isomorphous  replacement,  powerful 
method  of  analysis  though  it  is,  will  only  be  exploited  usefully  if  it  can  be 
extended  to  three  dimensions  in  order  to  circumvent  the  difficulty  of  overlap 
in  projections.  We  shall  consider  how  this  might  be  done  after  a  brief  dis- 
cussion of  the  chemical  problems  involved  in  attaching  heavy-atom  deriva- 
tives to  myoglobin. 


THE  ATTACHMENT  OF  HEAVY  GROUPS 
TO  MYOGLOBIN 

In  haemoglobin  the  convenient  presence  of  free  sulphydryl  groups  makes 
possible  the  use  of  any  of  the  standard  — SH  group  reagents,  such  as  p- 
chloro-mercuribenzoate  or  methyl-mercury,  for  attaching  mercury  atoms  to 
the  molecule.  Myoglobin  contains  no  free  sulphydryl  groups  in  any  species 
which  we  have  so  far  examined,  and  accordingly  less  straightforward  methods 
must  be  adopted.  The  isomorphous  replacement  method  cannot  easily  be 
applied,  however,  unless  the  heavy  atom  is  attached  specifically  to  a  single 
site  in  the  molecule,  or  at  the  most  to  a  very  few  sites.  In  myoglobin  the 
Ips 
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obvious  unique  site  is  the  single  haem  group  itself,  but  we  found  that  although 
some  of  the  usual  haem  group  reagents  (such  as  isocyanides  and  nitroso- 
compounds)  could  be  synthesized  with  heavy  atoms  in  their  molecules,  it 
was  very  difficult  attaching  them  stoicheiometrically  to  the  haem  group, 
partly  because  of  the  low  solubility  of  reagents  containing  heavy  atoms  such 
as  mercury  and  partly  on  account  of  the  very  high  affinity  of  myoglobin 
for  oxygen.  Unless  the  most  stringently  anaerobic  conditions  are  main- 
tained the  complexes  of  myoglobin  with,  for  example,  isocyanides  rapidly 
dissociate:  and  we  have  not  so  far  been  able  to  carry  through  the  whole 
process  of  preparation,  crystallization  and  X-ray  photography  without  an 
unacceptable  amount  of  decomposition. 

The  approach  which  finally  proved  successful,  and  which  was  mainly 
developed  by  Drs  Bodo  and  Dintzis,  was  to  crystallize  myoglobin  in  the 
presence  of  small  amounts  of  suitable  ions  containing  heavy  metals  (gener- 
ally one  or  two  moles  of  ion  per  mole  of  protein)  ;  the  ions  were  selected 
in  the  light  of  such  general  information  as  is  available  about  the  chemical 
affinities  of  different  protein  side-chains.  Crystals  thus  prepared  were  tested 
directly  by  means  of  X-rays,  and  not  by  chemical  methods.  If  their  diffrac- 
tion pattern  proved  to  be  identical  with  that  of  a  normal  crystal  it  was 
evident  that  no  combination  had  taken  place.  But  if  there  were  appreciable 
changes  in  the  intensities  of  the  reflexions,  it  could  be  concluded  that  attach- 
ment had  taken  place.  The  next  stage  would  be  to  see  whether  the  com- 
bination was  at  a  single  site,  and  to  locate  that  site  in  the  unit  cell;  for,  as 
already  indicated,  indiscriminate  combination  with  a  large  number  of  sites 
on  the  protein  surface — with  all  the  lysine  amino  groups,  for  example — 
would  be  useless  for  the  present  purpose.  To  test  this,  Fourier  methods  are 
used;  just  as  we  can  prepare  a  difference  vector  projection  of  the  heavy- 
atom/heavy-atom  vectors  from  the  changes  in  diffracted  intensity,  so  by 
using  the  changes  in  amplitude  as  terms  in  a  Fourier  synthesis  (once  the 
signs  of  the  reflexions  have  been  established)  we  can  in  effect  subtract  out 
the  protein  and  locate  the  positions  of  the  heavy  atoms  responsible  for  the 
changes  in  the  diffraction  pattern.  An  example  of  the  resulting  'difference- 
Fourier  projection'  is  given  in  Fig.  3  ;  this  shows  the  effect  of  crystallizing 
myoglobin  with  /?-chloro-mercuribenzene  sulphonate  (PCMBS),  and  indi- 
cates that  a  single  mercury  atom  has  been  attached  to  each  of  the  two  myo- 
globin molecules  in  the  unit  cell  (the  other  atoms  in  PCMBS  are  too  light 
to  show  up  in  the  difference-Fourier  projection).  Of  course,  PCMBS  is  a 
reagent  for  sulphydryl  groups,  and  there  was  no  a  priori  reason  to  expect 
that  it  would  combine  with  myoglobin  at  all:  indeed  we  discovered  by 
accident  that  it  does  so,  and  we  still  have  no  certain  knowledge  of  the  chem- 
istry of  the  process.  The  same  is  true  of  most  of  the  other  reagents  we  have 
investigated.  These  include  the  ions  AUCI4-,  Ag+  (in  silver  nitrate),  \{g\^^-, 
Hg(NH3)^+  (made  by  dissolving  mercuric  oxide  in  concentrated  ammonium 
sulphate),  and  AUI4-.  The  results  of  these  experiments  are  summarized  in 
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Fig.  4,  which  shows  the  points  of  attachment  (in  projection)  of  the  principal 
ligands  which  were  successfully  put  on.  A  number  of  interesting  chemical 
questions  arise;  for  example,  AUI4-  occupies  the  same  site  as  Hgl42",  sug- 
gesting that  the  central  atom,  and  even  the  charge,  may  play  only  a  sub- 
sidiary role  in  determining  the  mode  of  attachment.  Again,  Ag+  and  AuCl4~ 
are  attached  to  the  same  site,  in  spite  of  their  charges  being  apparently 


Fig.  3.  DifFerence-Fourier  projection  of /7-chloro-mercuri-benzene  sulphonate  derivative 
of  Type  A  myoglobin  along  y,  showing  positions  of  mercury  atoms. 

opposite.  However,  it  is  found  that  the  gold  atom  becomes  attached  only 
if  the  crystals  are  left  for  many  months  in  their  mother  liquor  before  X-ray 
pictures  are  taken,  suggesting  that  some  secondary  chemical  process  is  a 
necessary  preliminary  to  combination. 

There  are  other  examples  in  which  we  understand  the  chemistry  rather 
better.  Thus  both /j-iodo-nitrosobenzene  and  H03S.C6H4.Hg.S.C6H4.NC 
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Fig.  4.  Heavy-atom  complexes  of  myoglobin  type  A,  showing  the  positions  of  heavy 
atoms  in  the  unit  cell  in  y  projection. 

(PCMBS — S.C6H4.NC)  are  specific  haem  group  reagents,  and  although  it 
was  not  possible  to  use  them  for  determining  the  signs  of  the  protein  re- 
flexions, we  were  able  to  use  the  protein  signs  established  by  other  methods 
to  work  backwards  and  locate  the  heavy  atoms  in  these  reagents  with  rela- 
tion to  the  crystal  axes.  In  each  complex  the  heavy  atom  is  separated  from 
the  iron  atom  of  the  haem  group  by  the  same  number  of  intermediate  atoms, 
and  we  do  in  fact  find,  as  Fig.  4  shows,  that  they  are  in  almost  the  same 
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position  in  the  cell.  We  conclude  that  the  iron  atom  of  the  haem  group  must 
be  somewhere  on  the  surface  of  a  sphere  with  centre  at  the  heavy-atom  posi- 
tion and  radius  of  some  6  A.  We  have  thus  succeeded  in  approximately 
locating  the  haem  group  with  respect  to  the  axes  of  the  unit  cell. 

At  the  other  extreme  from  the  specific  haem-group  reagents  are  some 
ions  for  which,  the  evidence  suggests,  combination  is  interstitial  rather  than 
truly  chemical;  that  is  to  say,  the  ion  lodges  in  a  convenient  niche  on  the 
protein  surface  or  perhaps  in  a  'cosy  corner'  between  two  neighbouring 
protein  molecules.  In  such  cases  the  ion  may  combine  specifically  with  only 
one  particular  crystal  form,  any  other  form  (prepared  by  different  methods, 
and  having  the  myoglobin  molecules  packed  together  in  a  different  manner) 
either  giving  no  combination  at  all,  or  else  perhaps  combination  at  several 
different  sites.  In  general  it  will  be  evident  that  we  are  very  ignorant  of  the 
way  in  which  the  heavy  groups  we  use  are  attached  to  the  protein.  In  most 
cases  it  might  be  quite  laborious  to  discover  by  conventional  chemical 
methods  what  mechanism  is  involved,  and  it  is  interesting  that  we  are  at 
present  able  to  locate  the  heavy  atoms  far  more  easily  by  X-rays  than  by 
chemical  methods,  and  this  with  certainty  although  the  structure  of  the 
protein  as  a  whole  is  still  unknown.  Even  dynamic  effects  may  be  observed. 
Thus  on  prolonged  irradiation  of  a  crystal  of  the  PCMBS  complex  with 
X-rays  the  mercury  atom  migrates  from  its  normal  position  in  the  unit  cell 
to  another  one,  which  turns  out  (in  projection  at  least)  to  be  identical  with 
the  site  occupied  by  the  heavy  atom  in  the  silver  and  gold  complexes! — 
the  difference-Fourier  projection  calculated  from  the  amplitudes  before  and 
after  irradiation  is  found  to  contain  a  region  of  negative  electron  density 
at  the  former  site  and  a  positive  peak  at  the  latter.  All  in  all,  methods  for 
attaching  heavy  groups  to  proteins  might  well  repay  study  by  conventional 
chemical  techniques,  and  certainly  protein  crystallographers  would  be  glad 
to  have  them  investigated  in  the  expectation  that  analogous  and  perhaps 
better  reagents  might  be  discovered. 

THREE-DIMENSIONAL    METHODS 

In  this  final  section  we  shall  consider  what  is  involved  in  making  a  Fourier 
synthesis  of  a  protein  crystal  in  three  dimensions.  All  the  reflexions  whose 
spacing  exceeds  the  resolution  required  must  be  included  as  terms  of  such 
a  synthesis,  and  a  large  majority  of  them  will  have  general  phases.  Their 
amplitudes,  together  with  the  corresponding  amplitudes  from  a  crystal  con- 
taining a  heavy  atom  in  a  known  position  in  the  unit  cell,  are  measured  by 
the  usual  techniques.  Now  if  we  wished  to  predict  the  change  in  the  ampli- 
tude of  a  reflexion  of  general  phase  which  would  be  caused  by  a  heavy 
atom  at  a  particular  site  in  the  unit  cell,  it  would  be  necessary  lo  treat  the 
amplitude  due  to  the  protein  and  that  due  to  the  heavy  atom  as  vectors, 
and  calculate  iheir  resultant  by  means  of  a  vector  diagram.  Thus  Fig.  5  {a) 
shows  how  the  protein  reflexion  (with  amplitude  |Fp|  and  phase  <f>^  and 
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a  heavy  atom  contribution  |f^|  (with  phase  ^^)  combine  to  give  a  reflexion 
of  amphtude  |Fp^|  and  phase  ^p^  in  the  compound  crystal.  In  practice 
we  are  presented  with  the  inverse  problem  :  we  are  given  |  Fp  ]  and  |  Fp^  | 
(the  square  roots  of  the  measured  intensities)  and  |  f^  |  and  ^^  (obtained  by 
straightforward  computation  from  the  known  position  of  the  heavy  atoms 
in  the  unit  cell).  We  proceed  as  shown  in  Fig.  5  (b),  drawing  two  concen- 
tric circles  whose  radii  are  [Fpl  and  |Fp^l  respectively,  and  marking  off" 
intercepts  equal  in  length  and  direction  to  the  heavy-atom  contribution  f^. 


(a) 


(b) 


(c) 


Fig.  5.  The  vector  combination  of  protein  and  heavy-atom  contributions  to  a  reflexion; 
for  explanation  see  text. 

It  is  evident  that  there  are  in  general  two  solutions  which  attribute  diff'erent 
values  <f)p  and  ^p'  to  the  phase  of  the  protein  reflexion.  This  ambiguity 
cannot  be  resolved  without  further  information,  but  as  was  shown  by 
Bokhoven,  Schoone  and  Bijvoet^  in  a  different  connexion,  a  decision  can 
be  reached  if  two  isomorphous  replacements  can  be  made  available  in  the 
same  unit  cell.  A  pair  of  vector  diagrams  can  now  be  drawn  for  every  re- 
flexion, one  for  each  of  the  isomorphous  replacements,  and  each  gives  two 
alternative  values  of  the  protein  phase  angle.  If  all  is  well  one  value  of  the 
angle  from  the  first  isomorphous  derivative  should  be  identical  with  one 
value  from  the  second  derivative,  and  this  must  be  the  correct  phase  angle 
(see  Figs.  5  (b)  and  (c)).  Proceeding  in  this  way  one  can,  theoretically,  deter- 
mine the  phase  angles  of  all  the  reflexions  in  the  diff'raction  pattern  and  then 
compute  a  three-dimensional  Fourier  synthesis  which  should  be  a  correct 
representation  of  the  electron  density  throughout  the  unit  cell. 

It  is  for  these  reasons  that  we  have  thought  it  worth  while  to  develop  a 
number  of  different  methods  of  attaching  heavy  groups  to  various  sites  on 
the  myoglobin  molecule,  even  though  for  two-dimensional  work  a  single 
isomorphous  derivative  is  sufficient  to  establish  nearly  all  the  signs  of  the 
real  reflexions.  Among  the  complexes  illustrated  in  Fig.  4,  we  have  found 
that  those  with  Hgl42-,  with  AUCI4-,  with  PCMBS,  and  with  Hg(NH3)2+, 
are  satisfactory  for  general  phase  determination.  In  addition  we  have  been 
able  to  prepare  crystals  containing  two  or  even  three  different  heavy  groups 
simultaneously,  e.g.  Au  and  PCMBS,  Au  and  Hg(NH3)2+,  PCMBS  and 
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Hg(NH3)2+  and  finally  the  triple  complex  containing  Au,  PCMBS  and 
Hg(NH3)2+  (see  for  example  Fig.  6).  Theoretically  two  separate  replace- 
ments would  suffice,  but  in  practice  it  is  a  great  advantage  to  use  three  or 
four,  since  experimental  errors  are  considerable,  and  for  any  particular  re- 
flexion the  contributions  of  one  or  more  of  the  heavy  atoms  may  happen 
to  be  so  small  as  to  preclude  their  being  used  for  phase  determination. 


Fig.  6.  Difference-Fourier  projection  of  double  heavy-atom  derivative  of  Type  A  myo- 
globin along  y,  containing  one  mole  each  of  PCMBS  and  Hg(NH3)2  +  per  mole  of  protein. 

In  addition,  it  is  a  matter  of  some  crystallographic  interest  that  with  certain  of 
the  complexes  a  failure  of  Friedel's  Law  is  observed,  in  other  words,  it  is  found 
that  with  these  crystals  the  intensities  of  two  so-called  'equivalent  reflexions', 
020  and  020  for  example,  are  different.  It  can  be  shown  that  this  effect  will  be  pro- 
nounced, if  the  wavelength  of  the  X-rays  used  is  in  the  neighbourhood  of  that  at 
which  one  or  more  of  the  atoms  of  the  structure  have  an  X-ray  absorption  band. 
Pepinsky  and  Okaya^  have  shown  how  it  can  be  used  to  resolve  the  ambiguity  of 
phase  determination  with  a  single  isomorphous  replacement;  in  protein  crystals 
the  anomalous  effect  is  not  large  enough  for  this  to  be  done,  but  it  can  be  a  valu- 
able confirmation  of  the  phase  determined  by  other  methods.  This  anomalous 
dispersion  effect  has  also  been  used  by  Blow  in  his  work  on  the  x  projection  of 
haemoglobin. 

In  our  phase  determinations  with  myoglobin  we  have  so  far  confined 
our  attention  to  reflexions  of  spacing  greater  than  6  Â.,  the  intention  being 
to  compute  in  the  first  instance  a  three-dimensional  synthesis  with  this  re- 
solution. The  number  of  reflexions  involved  is  rather  more  than  400,  of 
which  about  one-quarter  are  real,  with  signs  already  determined  in  the 
two-dimensional  work.  We  are  at  the  present  time  calculating  the  phases 
of  the  remaining  300  reflexions.  This  task  is  not  yet  completed,  but  as  a 
first  instalment  of  the  results,  a  Fourier  projection  along  the  z  axis  of  the 
crystal  is  shown  in  Fig.  7  ;  this  is  computed  from  50  terms  of  which  40  have 
general  phase.  This  projection  is,  of  course,  subject  to  the  same  confusions 
due  to  overlap  as  is  the  y  projection  illustrated  in  Fig.  2;  it  has  been  repro- 
duced mainly  to  demonstrate  the  practicabihty  of  general  phase  determina- 
tion in  a  protein  crystal.  An  analogous  projection  of  haemoglobin  has 
recently  been  calculated  by  Blow. 
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Experience  gained  in  preparing  tiiis  projection  suggests  tliat  we  shall  in 
most  cases  be  able  to  determine  phase  angles  within  ±15-20°,  We  think 
there  is  a  good  chance  that  a  6  Â.  three-dimensional  Fourier  synthesis  cal- 
culated using  such  phases  will  reveal  some  recognizable  features  of  the 
structure.  In  any  case  such  a  synthesis  is  but  a  step  towards  the  goal  of 
resolving  individual  atoms.  We  see  no  reason  why  the  methods  now  being 

a    sin  /3 
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Fig.  7.  Fourier  projection  of  Type  A  myoglobin  crystals  along  z,  including  all  terms 
of  spacing  greater  than  6  Â 

used,  or  closely  related  ones,  cannot  be  used  at  higher  resolutions,  although 
the  amount  of  work  involved  will  in  any  case  be  considerable.  Indeed,  to 
resolve  individual  atoms  would  mean  measuring  at  least  all  the  reflexions 
with  spacings  exceeding  1-5  Â.,  that  is  to  say  25,000  reflexions  per  crystal. 
It  would  be  premature  to  guess  how  far  it  may  prove  possible  to  carry  the 
analysis,  although  it  seems  certain  that  some  extension  beyond  a  resolution 
of  6  Â.  will  be  practicable. 


REFERENCES 

1.  D.  W.  GREEN,   V.   M.  INGRAM  and  M.  F.    PERUTZ,  Proc.  Roy.  Soc,   A  225,   287 

(1954). 

2.  L.  BRAGG  and  m.  f.  PERUTZ,  Proc.  Roy.  Soc,  A  225,  315  (1954). 

3.  D.  w.  GREEN  and  A.  c.  T.  NORTH.  Unpublished  data. 

4.  G.  BODO,  H.  M.  DiNTZis  and  J.  c.  KENDREW.  Unpublished  data. 

5.  H.  scouLOUDi  and  j.  c.  kendrew.  Unpublished  data. 

6.  D.  MARKER,  in  Biological  and  Medical  Physics,  Vol.  IV,  Ed.  J.  H.  Lawrence  and 

C.  A.  Tobias,  Academic  Press,  New  York  1956. 

7.  c.  H.  CARLISLE  and  J.  D.  BERNAL.  Unpublished  data. 

8.  c.  BOKHOVEN,  J.  c.  SCHOONE  and  J.  M.  BIJVOET,  Acta  Cryst.,  4,  275  (1951). 

9.  R.  PEPiNSKY  and  y.  okaya,  Proc.  nat.  Acad.  Sei.,  Wash.,  42,  286  (1956). 


X-ray  analysis  of  haemoglobin 

M.  F.  PERUTZ 

Medical  Research  Council  Unit  for  Molecular  Biology,  Cavendish  Laboratory, 
University  of  Cambridge 


When  Pauling  and  Corey  first  noted  the  correspondence  between  the  three- 
dimensional  vector  structure  of  haemoglobin  and  the  vector  distribution  to 
be  expected  from  the  a-helix/  and  I  subsequently  found  a  weak,  but  detect- 
able, 1-5  Â  reflexion  in  haemoglobin, ^  I  beheved  that  a  solution  of  the 
haemoglobin  structure  in  terms  of  parallel  polypeptide  chains  might  be  found 
by  trial.  This  led  to  an  attempt  by  Bragg,  Howells  and  myself^  to  calculate 
the  electron  density  distribution  viewed  along  the  suspected  chain  direc- 
tion of  the  molecule,  which  coincided  with  the  a  axis  of  the  haemoglobin 
crystal.  A  Fourier  projection  on  the  a  plane,  corresponding  to  an  end-on 
view  of  a-helices,  was  indeed  obtained,  but  its  validity  rested  on  the  assump- 
tion that  the  interpretation  of  the  structure  in  terms  of  parallel  a-helices 
was  correct.  When  the  absolute  heights  of  the  electron  density  peaks  were 
measured,  they  were  found  to  amount  to  only  one-third  to  a  half  of  the 
heights  to  be  expected  from  the  initial  assumptions,  which  showed  that  the 
model  was  oversimplified.  Crick  arrived  at  the  same  conclusion  by  a  dif- 
ferent argument,*  and  also  pointed  out  that  a-helices  are  likely  to  pack 
together  at  an  angle  of  about  20°,  which  speaks  against  their  being  parallel 
in  haemoglobin.^ 

The  failure  of  this  attempt  at  analysis  by  trial  pointed  to  the  need  for 
a  different  approach,  and  one  that  was  free  from  any  preconceived  notion 
about  the  structure  of  the  molecule.  In  practice  this  meant  that  the  phases 
of  the  diff"racted  rays  would  have  to  be  determined  by  direct  methods.  In 
a  series  of  papers  Bragg,  myself  and  our  collaborators  have  now  described 
four  different  methods  of  phase  determination.  The  most  useful  of  these 
was  based  on  the  method  of  isomorphous  substitution  described  by  Dr 
Kendrew  (p.  125).  Green,  Ingram  and  Perutz^  crystallized  compounds  of 
horse  haemoglobin  in  which  the  sulphydryl  groups  were  blocked  by  /j-chloro- 
mercuribenzoate  or  by  silver  ions.  These  compounds  proved  to  be  iso- 
morphous with  normal  horse  methaemoglobin.  From  the  changes  in  the 
intensities  of  the  X-ray  diff'raction  pattern  produced  by  the  presence  of  the 
heavy  metals.  Green,  Ingram  and  Perutz'  were  able  to  determine  the  signs 
of  87  out  of  the  94  hOl  reflexions  covered  by  their  X-ray  photograph.  From 
this  and  other,  supplementary,  information  Bragg  and  Perutz  then  calculated 
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the  first  maps  of  the  electron  density  distribution  in  the  haemoglobin 
molecule  in  projection  along  the  dyad,  or  b  axis.'  Two  maps  were  calcu- 
lated; one  showing  a  projected  picture  of  the  volume  of  hydrated  protein 
into  which  salt  cannot  penetrate,  from  which  the  external  shape  of  the 
molecule  could  be  deduced.  The  other  map  showed  a  projected  picture  of 
the  electron  density  distribution  within  a  row  of  haemoglobin  molecules 
suspended  in  water.  Both  projections  were  along  the  b  axis  of  the  crystal, 
i.e.  at  right  angles  to  a,  the  suspected  chain  direction  along  which  Bragg, 
Howells  and  myself  had  attempted  to  calculate  the  first  projected  picture 
of  the  molecule.  The  b  projection  is  centro-symmetric  so  that  only  the  signs 
of  the  X-ray  reflexions  have  to  be  determined,  whereas  the  a  projection  is 
not  and  the  phase  angle  may  have  any  value. 

The  maps  give  the  external  shape  of  the  molecule  as  an  ellipsoid  of 
55  X  55  X  70  Â,  with  a  dimple  at  the  centre,  or  possibly  two  dimples  at  the 
centre  on  either  side  of  the  molecule.  They  also  show  that  the  four  free 
sulphydryl  groups  of  native  haemoglobin^  are  arranged  in  two  close  pairs 
spaced  30  Â  apart  and  related  by  the  dyad  axis  of  symmetry.  Unpublished 
work  on  two  projections  of  ox  haemoglobin  since  done  by  D.  W.  Green  and 
A.  C.  T.  North  suggests  that  these  SH-groups  are  situated  at  about  7  Â 
from  the  surface  of  the  molecule.  Apart  from  the  sulphydryl  groups,  which 
can  be  located  by  the  heavy  atoms  attached  to  them,  the  projection  shows 
a  system  of  peaks  and  depressions  which  has  so  far  defied  interpretation 
and  is  far  too  complex  to  fit  into  any  simple  structural  theory.  One  obstacle 
to  interpretation  is  lack  of  resolution  ;  this  is  so  low  that  single  atoms  are 
smeared  out  to  a  diameter  of  9  Â.  The  main  obstacle,  however,  is  the  great 
thickness  of  matter,  corresponding  to  about  30  atoms,  which  is  projected 
on  to  one  plane,  thus  obscuring  the  individual  parts  of  the  molecule  by 
superposition. 

At  the  next  stage  of  the  analysis  Ann  F.  Cullis,  H.  M.  Dintzis  and  myself 
have  now  tried  both  to  improve  the  resolution  of  the  projection  along  the 
b  axis  and  to  prepare  some  more  heavy  atom  complexes  in  preparation  for 
a  three-dimensional  analysis  of  the  structure.  At  the  same  time  D.  M.  Blow 
has  made  a  renewed  attempt  to  solve  the  projection  along  the  a  axis,  this 
time  making  no  assumptions  about  the  structure,  but  relying  entirely  on 
isomorphous  substitution  to  determine  the  phase  angles. 

An  additional  pair  of  sites  combining  with  heavy  atoms  was  found  by 
blocking  the  sulphydryl  groups  with  iodoacetamide  and  then  allowing  the 
haemoglobin  to  react  with  two  equivalents  of  mercuridiacetate.  The  posi- 
tions of  these  new  mercury-combining  sites  in  relation  to  the  sulphydryl 
groups  are  shown  in  Fig.  1 .  Their  chemical  character  is  still  unknown.  For 
calculating  the  improved  Fourier  projection  along  the  b  axis  we  determined 
the  signs  of  420  reflexions  instead  of  the  94  used  in  the  first  round,  and 
thus  attained  a  resolution  of  2-8  Â  instead  of  the  original  6-5.  The  new  map, 
shown  in  Fig.  2,  contains  far  more  detail  than  the  earlier  one,  but  still  fails 
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Fig.  1 .  Positions  of  mercury  atoms  in  different  haemoglobin  derivatives.  Tlie  /7-chloro- 
mercuribenzoate  and  dimercuriacetate  groups  are  attached  to  the  sulphydryl  groups.  The 
mercuri  di-acetate  groups  are  attached  to  a  site  whose  chemical  character  is  not  known. 
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Fig.  2.  Electron  density  distribution  in  the  haemoglobin  molecule  seen  in  projection 
along  the  dyad  axis  at  a  resolution  of  2-8  Â.  The  black  dots  represent  sulphydryl  groups. 
Contours  are  drawn  at  intervals  of  2-5  electrons/A^  above  the  level  of  the  suspension 
medium,  which  is  2m  ammonium  sulphate  solution.  For  the  sake  of  clarity  the  three 
lowest  contours  have  been  omitted,  and  only  the  peaks  are  shown.  The  contours  near 
the  periphery  of  the  molecule  are  not  exact,  being  partly  modified  by  the  overlapping  of 
neighbouring  molecules  in  the  crystal  lattice. 
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to  exhibit  any  features  which  could  be  identified  with  either  polypeptide 
chains  or  haem  groups. 

Clearly  it  is  still  the  great  thickness  of  matter  projected  on  one  plane 
which  defeats  us,  and  it  looks  as  though  only  a  three-dimensional  analysis, 
giving  the  electron  density  distribution  along  a  series  of  parallel  sections 
through  the  molecule,  will  finally  reveal  its  structure.  This  will  have  to  be 
the  next  step. 

Dr  Kendrew  has  already  explained  to  you  that  this  involves  determina- 
tion of  the  phase  angles  of  each  one  of  many  thousands  of  X-ray  reflexions, 
which  is  far  more  diflScult  than  the  determination  of  signs  in  the  projection 
along  the  two-fold  axis  to  which  our  work  had  been  confined  so  far.  It 
seemed  especially  doubtful  if  the  phase  angles  could  be  measured  with  suffi- 
cient accuracy  in  a  molecule  as  big  as  haemoglobin,  where  the  number  of 
sites  combining  with  heavy  atoms  per  unit  molecular  weight  of  protein  is 
only  half  as  great  as  in  myoglobin.  The  projection  along  the  a  axis,  which 
lacks  a  centre  of  symmetry,  was  therefore  chosen  as  a  test  case  for  a  later 
three-dimensional  analysis.  Its  successful  solution  by  D.  M.  Blow  is  most 
encouraging,  though  the  accuracy  of  the  phase  angles  so  far  obtained  could 
be  much  improved  if  one  or  two  more  sites  combining  with  heavy  atoms 
could  be  discovered. 

In  conclusion  I  should  like  to  mention  one  interesting  piece  of  informa- 
tion about  the  structure  of  the  haemoglobin  molecule  obtained  by  D.  J.  E. 
Ingram,  J.  F.  Gibson  and  myself.^  We  determined  the  orientation  of  the  four 
haem  groups  in  the  haemoglobin  molecule  by  electron  spin  resonance,  using 
a  single  crystal  of  horse  haemoglobin.  We  found  the  haem  groups  to  be 
arranged  in  two  pairs,  related  by  the  dyad  axis  of  symmetry.  The  normals 
to  one  pair  of  haem  groups  lie  in  the  crystallographic  a,  b  plane  at  an  angle 
of  32°  on  either  side  of  the  a  axis.  The  normals  to  the  other  pair  have  a 
similar  orientation,  except  that  they  are  tilted  by  13°  above  and  below  the 
a,  b  plane.  Knowing  the  orientation  of  the  haem  groups  tells  us  nothing  as 
yet  about  their  positions,  but  it  will  make  it  much  easier  to  find  them  by 
X-ray  analysis  later  on. 
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Further  study  on  myoglobin* 

I.  Heterogeneity  of  human  myoglobin 

Some  properties  of  Mb  I  and  Mb  II 
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Institute  of  Biological  Chemistry,  University  of  Rome 


In  connection  with  the  problem  of  heterogeneity  of  crystalHne  proteins,  we 
should  like  to  refer  to  our  work  on  human  crystalline  myoglobin  (Mb).  We 
have  demonstrated  that  at  least  three  components  (Mb  I,  Mb  II,  Mb  III),  dif- 
fering from  one  another  in  their  electrophoretic  behaviour,  are  present  in 
preparations  of  human  crystalhne  Mb.^  The  relative  percentages  of  these 
components  are  as  follows:  Mb  I,  74%;  Mb  II,  19%;  Mb  III,  7%.  It  has 
been  possible,  by  means  of  preparative  zone  electrophoresis  (on  paper  and 
on  starch  geP),  to  isolate  Mb  I  and  Mb  II  and  to  study  some  of  their  pro- 
perties. 

As  shown  in  Fig.  1,  the  isolated  components  proved  to  be  homogeneous 
when  subjected  again  to  paper  electrophoresis.  Spectrophotometric  analysis 
of  the  various  derivatives  of  Mb  I  and  Mb  II  (Mb,  MbOg,  MbCO,  Mb+) 
showed  that  the  two  components  have  absorption  curves  almost  identical 
with  each  other  both  in  the  visible  and  Soret  region  and  which  correspond 
with  those  obtained  for  unfractionated  crystalline  human  Mb. 

A  detailed  study  of  the  reversible  combination  with  02^  revealed  that 
Mb  I  and  Mb  II  are  almost  identical  as  regards  oxygen  equilibrium,  either 
with  respect  to  the  shape  of  the  dissociation  curve  (Figs.  2  and  3)  and  O2 
affinity  (Fig.  4)  at  different  temperatures.  These  results  seem  interesting, 
particularly  as  far  as  the  minor  component  (Mb  II)  is  concerned;  this 
pigment  has  all  the  properties  of  a  true  myoglobin:  absorption  spectra, 
reversible  combination  with  O2,  shape  of  the  dissociation  curve  and  affinity 
for  O2,  so  that  we  must  definitely  give  up  the  idea  that  we  are  dealing  with 
an  impurity  or  an  artefact. 

We  must  reject  also  the  hypothesis  that  Mb  II  is  a  residue  of  foetal  Mb, 
since  experiments  we  have  carried  out  demonstrate  that  foetal  Mb  cannot 
be  separated  by  electrophoresis  from  adult  Mb,  and  that  preparations  of 
foetal  Mb  show  the  presence  of  several  components  which  have  electro- 
phoretic characteristics  similar  to  those  found  in  adult  Mb. 
*  Aided  by  a  grant  from  the  Rockefeller  Foundation. 
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Fig.  1.  Paper  electrophoresis  of  homogeneous  Mb  I,^  Mb  II,^  and  non-homogeneous 
crystalline  myoglobin.  2 


l.On 


Fig.  2.  Oxygen  dissociation  curve   of  human 
Mb  I  (5  X  10-5  M),  at  20°  C.  and  pH  8;  Y= 

Mb^M^^bO.'  P^=°-^^  "^  "^-  ^^  '^^  ^^^^ 
denote  concentrations  P^  oxygen  pressure  (at  20°) 
for  50%  oxygen  saturation. 


pO^     "tin  H  g 


5  6  7 

p02      mm  Hg 

Fig.  3.  Oxygen  dissociation  curve  of  human  Mb  II  (4  x  10 -^M),  at  20°  C  and  pH  8; 
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Fig.  4.  Influence  of  temperature  on  oxygen  affinity  of  human  Mb  I  9  and  Mb  II  o; 

pH  8,  borate  buffer  1=005  .  K=— ^ %-.  Mb  I  JH=-13-5  kcal;  Mb  II  JH=-13-3 

Mb  X  PO2 

kcal  (calculated  from  the  slopes  of  the  lines.) 
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From  a  practical  point  of  view,  the  above  results  lead  to  some  limitations 
in  the  use  of  the  crystalline  pigment  in  future  research  on  myoglobin.  Studies, 
also  with  spectrophotometric  methods,  on  the  equilibrium  between  human 
Mb  and  O2,  can  be  properly  based  on  non-homogeneous  crystalline  pigment, 
whilst  in  any  research  on  the  structure  of  myoglobin  homogeneous  prepara- 
tions must  be  used. 

Research  still  in  progress  in  our  Laboratory  leads  us  to  think  that  human 
Mb  I  and  Mb  II  have  different  chemical  structures. 
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IL  Chemical  and  Biochemical  properties  of  a 
new  type  of  myoglobin  in  molluscs 

A.  ROSSI-FANELLI,  E.  ANTONINI, 
D.  POVOLEDO 

Institute  of  Biological  Chemistry,  University  of  Rome 
and  Zoological  Station,  Naples 


The  object  of  this  paper  is  the  study  of  a  new  type  of  myoglobin  (Mb) 
isolated  and  crystallized  from  the  red  buccal  muscles  of  two  common  Medi- 
terranean moUusks:  Aplysia  Depilans  and  Aplysia  Limacina*  The  purifica- 
tion of  this  pigment  has  been  previously  described. ^-^  The  homogeneity  of 
crystallized  Aplysia  Mb  was  demonstrated  by  paper  and  boundary  electro- 
phoresis and  by  the  ultracentrifuge  (Fig.  1).  The  isoelectric  point  (deter- 
mined by  paper  electrophoresis  owing  to  the  small  quantity  of  material 
available)  was  found  to  be  near  pH  4-5  in  acetate  buffer  1=0-05, 

The  sedimentation  constant  and  the  diffusion  rate  of  the  pigment  were 
determined  by  using  the  Spinco  Ultracentrifuge  and  the  Perkin  Elmer  elec- 
trophoresis apparatus. ^-^  The  sedimentation  constant  S°2o  was  found  to 
be  2-06  xlO~^^;  this  is  very  similar  to  the  figures  obtained  for  myoglobin 
of  diff"erent  animal  species  by  other  authors. ^-^ 

From  these  data  it  was  possible  to  establish  the  molecular  weight  as 
approximately  20,000;  the  minimum  molecular  weight  deduced  from  the 
haem  content  (0-30%  iron)  is  18,000.  These  values  are  comparable  with 
those  recently  found  for  pure  and  homogeneous  horse  Mb  by  Theorell  and 
Âkeson.^  The  accurate  spectrophotometric  analysis  of  some  derivatives  of 
Aplysia  Mb  (Mb+,  MbOH,  MbOa,  MbCO,  Mb)  showed  that  both  visible 
and  ultra-violet  spectra  of  this  pigment  differ  widely  from  those  of  mam- 
malian Mb  (see  Table  2). 

The  reversible  combination  with  O2  and  CO  was  studied  with  the  spectro- 
photometric method  described  elsewhere.'  It  can  be  seen  from  Fig.  2  that 
the  O2  dissociation  curve  of  Aplysia  Mb,  like  the  mammalian  Mb,  is  hyperbolic, 
but  the  oxygen  affinity  is  lower:  P  *  at  20°  =  2-7  mm.  Hg.  The  Bohr  effect 
is  almost  absent  (d  log  P  |)/(d  pH)^=0-02).  The  overall  heat  for  the  reac- 
tion Mb+02->Mb02  calculated  from  the  van't  Hoff  equation  between 
10°  and  30°  is  -13-6  kcal. 

*  All  the  properties  analysed  showed  no  significant  difference  between  Mb  obtained 
from  Aplysia  Depilans  and  Aplysia  Limacina. 
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The  values  of  K  for  the  reaction  MbOo+CO^MbCO+Og  was  found 

to  be  106  at  20"  and  pH  7-4  and  that  of  y  (y=*-^^)  0-028.  This  figure 

is  remarkably  different  from  that  obtained  by  Theorell  for  horse  Mb^  and 
by  us  for  human  Mb  (y  =  0-048). 

The  difference  in  the  properties  of  Aplysia  and  mammalian  Mb  could 
arise  from  a  substantial  difference  in  the  chemical  structure  of  the  two 
proteins.  The  prosthetic  group  of  Aplysia  Mb  is  identical  with  that  of  horse 


mm  Hg 


log  pO^ 


Fig.  2.  Oxygen  dissociation  curve  of  Aplysia  Mb  (~3  x  10~%)  at  20°  C  and  pH  7-2  Y= 
MbOa 


Mb+MbOa 


Mb:  this  was  demonstrated  by  spectrophotometry  of  pyridine  haemochro- 
mogen  and  by  resynthesis  of  each  pigment  from  its  own  globin  and  the 
haem  split  from  the  other.  We  then  analysed  the  amino  acid  composition 
of  crystallized  Mb  from  Aplysia  Depilans  and  Aplysia  Limacina,  by  ion 
exchange  chromatography  according  to  Moore  and  Stein. ^  The  results  ob- 
tained are  reported  in  Table  1,  which  includes  for  comparison  the  amino 
acid  composition  of  a  mammalian  (human)  Mb.^''  Table  1  shows  a  notice- 
able difference  in  the  amino  acid  content  between  Aplysia  and  human  Mb. 
The  former  is  richer  in  aspartic  acid,  serine,  valine,  phenyl  alanine,  arginine, 
and  poorer  in  threonine,  glutamic  acid,  proHne,  tyrosine,  hystidine  and 
lysine.  Aplysia  Mb  is  characterized  by  a  higher  content  of  dicarboxylic 
amino  acids  and  by  a  small  content  of  basic  amino  acids.  Histidine  is  pre- 
sent in  such  a  quantity  as  to  account  for  only  one  mole  per  mole  of  protein. 
This  result  seems  very  interesting  in  connection  with  the  role  played  by 
imidazole  as  haem-linked  group. 
Kps 
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The  most  important  chemical  and  physico-chemical  properties  of  Aplysia 
Mb  compared  with  mammalian  Mb,  are  summarized  in  Table  2.  The  data 
show  that  the  pigment  isolated  from  Aplysia  is  very  similar  to  mammalian 
myoglobins  with  respect  to  molecular  weight,  haem  content,  the  shape  of 
the  oxygen  dissociation  curve  and  the  absence  of  the  Bohr  effect.  On  the 
other  hand,  Aplysia  Mb  shows  some  characteristic  features,  namely  absorp- 
tion spectra,  'span',  relative  affinity  for  CO  and  O2,  isoelectric  point  and 
chemical  nature  of  globin,  that  strongly  differentiate  this  pigment  from  all 
the  known  myoglobins. 

Table  1 

AMINO    ACID    COMPOSITION    OF    APLYSIA 

DEPILANS    MYOGLOBIN   AND    HUMAN 

MYOGLOBIN 

(Average  values  of  three  determinations) 
Grams  amino  acid  per  100  g.  of  protein 


Aspartic  acid 

Aplysia  Mb 

Human  Mb 

12-71 

8-27 

Threonine 

1-70 

2-85 

Serine 

8-29 

4-43 

Glutamic  acid 

9-58 

16-17 

Proline 

0-98 

5-40 

Glycine 

4-60 

6-08 

Alanine 

11-66 

5-82 

Cystine 

0-00 

000 

Valine 

600 

4-64 

Methionine 

1-88 

2-69 

Isoleucine 

4-89 

5-27 

Leucine 

8-73 

13-67 

Tyrosine 

1-48 

2-19 

Phenylalanine 

11-75 

8-22 

Histidine 

1-22 

7-79 

Lysine 

6-89 

19-09 

Arginine 

4-57 

2-47 

9] 
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CHEMICAL    AND    PHYSICO-CHEMICAL    CONSTANTS    OF    APLYSIA 
MYOGLOBIN    AND    MAMMALIAN    MYOGLOBIN 


Aplysia  Mb 

Mammalian  Mb 

Prosthetic  group 

(Depilans  and  Limacina) 

(Horse) 

Ferroprotoporphyrin  IX 

Ferroprotoporphyrin  IX 

Molecular  weight 

18,000-20,000 

17,000-19,000 

S  20 

2-06x10-13 

1-97x10-13 

Haem/mole 

1 

1 

Shape  of  the  oxygen  dissocia- 

tion curve 

hyperbolic 

hyperbolic 

P  ^  (20°  pH  7-4) 

2-7  mm  Hg 

0-7  mm  Hg 

Bohr  effect 

absent 

absent 

'Span' 

72  Â 

30  Â 

[MbCOjxpOa 

106 

28 

[Mb02]xpC0 

logK 

y= 

0  028 

0-045 

span 

Isoelectric  point 

~4-5 

6-78 

Chemical  nature  of  globin 

acidic 

basic 

Auto-oxidation 

fairly  rapid 

fairly  rapid 

Absorption  spectra* 

Mb+        max  (mju) 

400 

505        640 

408    505        630 

e  X  10-3 

99 

131        3-8 

160      10-2        4-2 

MbOH     max  (m/x) 

412 

543        580       600 

414    540       580 

€XlO-3 

91 

9           8-7        8-7 

90        9-1         8-6 

MbCO     max  (m/x) 

424 

541        571 

423     540       578 

ex  10-3 

176 

14-2      13-7 

178       14         11-8 

Mb          max  (m^) 

438 

555 

434                 555 

ex  10-3 

113 

13 

113                   13-3 

Mb02      max  (m/x) 

416 

542       578 

ex  10-3 

108 

13-2       13-3 

*  Data  for  horse  Mb  obtained  by  the  Authors  in  the  present  work. 
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The  chemical  effects  of  gene  mutations  in  some 
abnormal  human  haemoglobins 

J.  A.  HUNT  AND  V.  M.  INGRAM 

Medical  Research  Council  Unit  for  Molecular  Biology, 
Cavendish  Laboratory,  University  of  Cambridge 


It  has  been  known  for  some  years  that  the  inherited  haemolytic  anaemia, 
known  as  sickle  cell  anaemia,  is  due  to  an  alteration  in  the  structure  of  the 
haemoglobin  molecule.^  This  change  is  thought  to  be  caused  by  a  single 
mutation  of  the  gene  responsible  for  haemoglobin  synthesis.^  We  can 
now  report  that  the  effect  of  this  mutation  is  to  change  only  one  amino 
acid  residue  in  polypeptide  chains  containing  nearly  300  residues;  one 
of  the  glutamic  acids  in  normal  haemoglobin  has  been  replaced  by 
valine. 

Earlier  experiments  indicated  that  the  two  haemoglobins  differed  electro- 
phoretically,^'^  haemoglobin  S — the  sickle  cell  anaemia  type — having  2-3 
carboxyl  groups  fewer  than  haemoglobin  A — the  normal  protein.  Deoxy- 
genated  haemoglobin  S  has  an  abnormally  low  solubility*  which  causes  the 
sickling  of  the  red  blood  cells  characteristic  of  this  anaemia.  However, 
determinations  of  amino  acid  composition,^-^  A^-terminaF-^-^  and  C- 
terminal  amino  acids^^*  or  sulphydryl  groups^^'^^  did  not  show  any  signifi- 
cant differences  between  them.  More  recently, ^^  degradation  of  the  two 
haemoglobins  with  trypsin  showed  that  the  alteration  which  produces  haemo- 
globin S  resides  in  one  very  short  section  of  the  polypeptide  chains.  Whilst 
most  of  the  fragments  in  the  two  haemoglobin  showed  similar  behaviour 
on  electrophoresis  and  chromatography,  a  pair  of  peptides,  called  No.  4, 
were  found  in  which  these  properties  differed.  The  complete  chemical  struc- 
ture of  these  two  peptides  has  now  been  determined  by  amino  acid  analyses, 
end  group  analyses,  some  Edman  stepwise  degradations  and  the  results  of 
partial  acid  hydrolyses — the  last  are  shown  in  Fig.  1  (a) — which  led  to  the 
formulation  of  the  two  sequences.^*  They  each  consist  of  the  nine  amino 
acids  shown  in  Fig.  1  (b).  It  can  be  seen  that  they  differ  in  only  one  amino  acid  ; 
one  of  the  glutamic  acids  of  the  normal  No.  4  peptide  is  replaced  by  valine 
in  the  sickle  cell  No.  4  peptide.  To  date  further  investigation  of  the  other 
peptides  which  had  been  obtained  by  trypsin  digestion,  has  failed  to  reveal 
any  additional  differences.  The  trypsin  resistant  cores  of  the  two  haemo- 
globin molecules — about  30%  of  the  molecule — have  been  digested  with 
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F/g.  1  (a).  Acid  Degradation  of  the  No.  4  Peptides  from  Haemoglobins  A  and  S. 
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Fig.  1  {b).  Structure  of  the  No.  4  Peptides  from  Haemoglobins  A,  S  and  C. 
(H  =  His=histidine,  V= Val  =  valine,  L= Leu = leucine,  T=Thr= threonine,  P=Pro= 
proline,  G  =  Glu = glutamic  acid,  Ly  =  Lys = lysine.) 
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chymotrypsin  and  subjected  to  examination  by  fingerprinting^^  and  chromato- 
graphy ;  no  differences  could  be  detected. 

This  finding  is  in  agreement  with  such  differences  between  the  two  haemo- 
globins as  had  already  been  established,  when  it  is  remembered  that  haemoglo- 
bin (mol.  wt.  66,700)  has  two  identical  half-molecules,^^  and  that  therefore 
the  No.  4  peptide  occurs  twice  in  the  whole  molecule.  It  still  remains  to 
relate  this  change  of  amino  acids  to  the  solubility  differences  and  to  the 
causation  of  the  haemolytic  anaemia.  However,  it  is  the  kind  of  change 
which  one  would  expect. 

Human  haemoglobin  S  is  a  clear  case  where  the  mutation  of  a  nuclear 
gene  is  inherited  in  a  strictly  Mendelian  manner.  Furthermore,  it  is  well 
established  that  this  mutation  alters  the  nature  of  the  protein  haemoglobin. 
We  can  now  show  for  the  first  time  that  the  effect  of  such  a  mutation  is  a 
chemical  one  and  that  it  leads  in  this  case  at  least  to  a  substitution  of  only 
one  amino  acid  in  the  protein  by  another  one.  One  may  speculate  that  this 
very  small  change  in  the  polypeptide  chain  of  the  protein  reflects  a  very 
small  change  in  the  DNA  chain  of  the  gene,  affecting  perhaps  not  more 
than  one  of  the  nucleotide  base  pairs  of  that  chain. 

Another  mutation  of  the  same  haemoglobin  gene  produces  another  ab- 
normal protein,  haemoglobin  C.^^-^'  This  causes  a  severe  anaemia  only 
when  it  occurs  together  with  haemoglobin  S.  The  solubility  of  haemoglobin 
C  is  almost  normal, ^^-^^  and  so  is  its  amino  acid  composition;^^  however, 
it  is  easily  distinguished  electrophoretically,  since  it  has  even  fewer  net 
negative  charges  per  molecule  than  haemoglobin  S  which  itself  has  two 
fewer  than  the  haemoglobin  A  molecule. 

Haemoglobin  C  has  been  degraded  with  trypsin  and,  as  a  second  stage, 
with  chymotrypsin,  using  the  methods  described  for  haemoglobin  S.  The 
mixtures  of  peptides  obtained  were  compared  with  those  from  haemoglobin 
A  by  the  'fingerprinting'^^  and  other  methods  employed  for  haemoglobin  S. 
Again,  it  was  found  that  all  peptides  behaved  identically  except  the  No.  4 
peptide  in  tryptic  haemoglobin  A  digests  whose  glutamic  acid  had  changed 
to  valine  in  haemoglobin  S.  This  same  peptide  was  not  found  in  tryptic 
digests  of  haemoglobin  C.  Its  place  was  taken  by  two  new  peptide  spots, 
one  neutral  and  the  other  positively  changed  at  pH  6-4.^^  Using  the  methods 
referred  to  above,  the  amino  acid  sequence  of  these  two  peptides  has  been 
determined  and  is  shown  in  Fig.  1  (b).  Clearly  the  same  amino  acid,  glu- 
tamic acid,  of  normal  haemoglobin  has  changed  also  in  the  haemoglobin  C 
mutation;  its  place  is  taken  by  lysine.  In  each  half  molecule  a  negatively 
charged  side  chain  has  been  replaced  by  a  positive  group  altering  the  net 
charge  of  the  protein  by  four  units,  twice  as  many  as  for  haemoglobin  S. 
This  interpretation  agrees  very  well  with  the  known  electrophoretic  series  of 
the  haemoglobins  which  is  A — S — C.  The  additional  peptide  found  in  tryp- 
tic digests  of  haemoglobin  C  is  due  to  the  introduction  of  an  additional 
lysine  peptide  bond  which  is  hydrolysed  by  the  enzyme. 
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Genetic  evidence  shows  that  the  haemoglobin  S  and  the  haemoglobin  C 
mutations  may  be  allelic,^'  i.e.  that  they  occur  in  the  same  locus.  We  can  now 
give  a  chemical  interpretation  of  this  statement,  not  in  terms  of  the  chemical 
constitution  of  the  gene  but  of  the  amino  acid  sequence  of  the  polypeptide 
chain  whose  synthesis  the  gene  controls.  In  the  two  mutations  discussed  in 
this  paper  the  same  very  small  portion  of  the  haemoglobin  gene  seems  to 
be  affected  and  in  turn  the  identical  amino  acid  in  the  polypeptide  chain  is 
replaced  by  either  valine  or  lysine  in  haemoglobin  S  and  C  respectively. 
The  two  mutations  are  indeed  alleles. 
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Etude  de  quelques  proteolyses  limitées  du 
chymotrypsinogène  de  bœuf 

P.  DESNUELLE  ET   M.  ROVERY 

Laboratoire  de  Chimie  Biologique,  Faculté  des  Sciences,  Marseille,  France 


L'objet  de  ce  mémoire  est  de  décrire  certaines  expériences  effectuées  dans 
le  but  de  mieux  comprendre  les  relations  existant  entre  la  structure  chimique 
du  chymotrypsinogène  et  son  caractère  de  précurseur  d'un  enzyme  protéo- 
lytique.  Comment  cette  molécule  inactive  acquiert  elle  le  pouvoir  d'hydrolyser 
d'autres  protéines?  Nous  savons  que  le  phénomène  d'activation  est  en  fait 
déterminé  par  une  protéolyse  limitée.^ 

I.    ACTIVATION    'RAPIDE'    DU    CHYMOTRYPSINOGÈNE 

Une  protéolyse  est  dite  limitée  quand  le  nombre  des  liaisons  coupées  par 
l'enzyme  est  très  faible.  Au  lieu  d'être  comme  d'habitude  transformée  en 
peptides  relativement  courts,  cette  protéine  donne  alors  naissance  à  une  ou 
à  plusieurs  autres  protéines  qui  ne  diffèrent  de  la  première  que  par  l'ouver- 
ture de  quelques  liaisons  et  éventuellement  la  perte  de  quelques  résidus. 
Affectant  la  structure  covalente  et  fondamentale  de  la  molécule,  ces  ruptures 
doivent  certainement  provoquer  des  remaniements  profonds  de  la  structure 
secondaire.  Or,  dans  certains  cas  tout  au  moins,  les  remaniements  en  ques- 
tion n'entrainent  pas  ce  qu'on  a  coutume  d'appeler  la  'dénaturation'.  Bien 
au  contraire,  certaines  protéines  engendrées  par  protéolyse  limitée  mani- 
festent une  activité  physiologique  précise  que  la  protéine-mère  ne  possède 
pas.  C'est  d'ailleurs  pourquoi  les  proteolyses  limitées  sont  si  intéressantes 
puisqu'elles  permettent  quelquefois  d'expliquer  en  termes  chimiques  simples 
ces  activations  par  'démasquage'  dont  le  principe  et  les  résultats  sont  connus 
depuis  de  nombreuses  années.  La  protéolyse  trypsique  qui  active  le  chymo- 
trypsinogène ne  peut  pas  être  plus  limitée  puisqu'elle  consiste  en  la  rupture 
d'une  liaison  unique,  reliant  un  résidu  d'arginine  à  un  résidu  d'isoleucine 
(liaison  arginyl-isoleucine  1  de  la  fig.  1). 

Il  est  d'ailleurs  tout  à  fait  vraisemblable  que  le  mécanisme  de  l'activation 
soit  nettement  plus  compliqué  que  celui  indiqué  dans  la  fig.  1.  La  variation 
sensible  du  pouvoir  rotatoire  au  moment  de  l'activation^  signifie  sans  doute 
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que  la  structure  secondaire  de  la  molécule  subit  quelques  remaniements.  On 
peut  alors  penser  que  ces  remaniements  créent  le  centre  actif  de  l'enzyme, 
en  rapprochant  par  exemple  deux  ou  plusieurs  groupes  fonctionnels  dont 
la  position  initiale  dans  le  zymogène  n'est  pas  favorable.  Mais,  si  l'on  s'en 
tient  à  la  structure  covalente  de  la  molécule,  la  rupture  précédemment  men- 
tionnée de  la  liaison  arginyl-isoleucine  semble  être  le  seul  événement  déter- 
minant l'activation.  La  fig.  2  montre  d'ailleurs  que,  tout  au  long  de  l'acti- 
vation,  les  proportions  d'isoleucine  N-terminale  formées  par  la  rupture 
de  la  liaison  arginyl-isoleucine  sont  bien  égales  au  pourcentage  d'activation. 
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Chymotrypsinogène        Chymotrypsine-TT      Chymotrypsine -à 

Fig.  1.  Schéma  de  l'activation  'rapide'  du  chymotrypsinogène.  La  liaison  1  est  coupée 
par  la  trypsine.  La  liaison  2  s'autolyse  au  sein  de  la  chymotrypsine- tt. 


c'est-à-dire  au  nombre  de  mole  de  chymotrypsine  formées  aux  dépens  de 
100  mole  de  chymotrypsinogène. 

La  rupture  de  la  liaison  arginyl-isoleucine  convertit  le  zymogène  en  une 
première  chymotrypsine  appelée  chymotrypsine-TT,^  laquelle  est  instable  et 
se  convertit  rapidement  par  autolyse  en  une  deuxième  chymotrypsine  douée 
de  la  même  activité,  la  chymotrypsine-S.  Cette  autolyse  intervient  au  niveau 
d'une  liaison  leucyl-sérine  (liaison  2)  et  elle  provoque  la  libération  d'un 
dipeptide,  la  sérylarginine.  La  libération  de  ce  peptide  n'a  d'ailleurs  aucune 
signification  particulière.  Elle  a  lieu  parce  que  les  liaisons  1  et  2  sont  très 
proches  l'une  de  l'autre.  Si  ces  deux  liaisons  étaient  séparées  par  un  pont 
disulfure,  aucun  peptide  ne  serait  libéré.  La  molécule  posséderait  simple- 
ment une  chaîne  'ouverte'  de  plus.  Le  fait  important  est  que  l'ensemble  du 
phénomène  soit  très  simple.  Deux  enzymes  difi'érents,  la  trypsine  et  la  chymo- 
trypsine agissent  l'un  après  l'autre.  Chaque  enzyme  coupe  une  seule  liaison 
choisie,  pour  une  raison  qui  nous  échappe  encore,  parmi  toutes  celles  qui 
satisfont  aussi  bien  les  exigences  structurales  des  enzymes  en  question. 
Rappelons  ici^  que  le  chymotrypsinogène  contient  4  résidus  d'arginine  et 
12  résidus  de  lysine  en  principe  favorables  à  l'action  trypsique.  Il  contient 
aussi  4  résidus  de  tyrosine  et  6  résidus  de  tryptophane,  sans  doute  plus 
favorables  à  l'action  chymotrypsique  que  le  résidu  de  leucine  à  coté  duquel  se 
produit  l'autolyse  de  la  chymotrypsine-7r.  Il  est  donc  évident  que  les  Uaisons 
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1  et  2  sont  particulièrement  exposées.  Des  considérations  de  forme  molécu- 
laire, d'enroulement  plus  ou  moins  serré  des  chaînes  peptidiques  jouent 
certainement  un  rôle  déterminant  dans  la  spécificité  du  phénomène. 

Le  phénomène  de  l'activation  'rapide'  n'est  d'ailleurs  pas  seulement 
simple.  Il  est  aussi  en  quelque  sorte  pur  car  les  chymotrypsines-vr  et  S  peuvent 
être  obtenues  avec  des  rendements  dépassant  95%.'*  Les  chaînes  peptidiques 
sont  donc  coupées  dans  ce  cas  par  la  trypsine  et  la  chymotrypsine  un  peu 
comme  on  coupe  une  ficelle  avec  une  paire  de  ciseaux.  Dans  la  fig.  2,  la 
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Fig.  2.  Cinétique  de  l'activation  'rapide'  du  chymotrypsinogène  : 
0  :  Activité  chymotrypsique  (  %)  vis  à  vis  de  l'acétyl-L-tyrosine  éthylester. 
O  '■  Isoleucine  N-terminale  (mole  pour  1  mole  de  zymogène). 

[n  :  Sérylarginine  (mole  pour  I  mole  de  zymogène)  ou  chymotrypsine-â  (mole  pour 
1  mole  de  zymogène). 
/\  :  Chymotrypsine- 77  (mole  pour  1  mole  de  zymogène). 

courbe  donnant  les  quantités  de  sérylarginine  en  fonction  du  temps  corres- 
pond à  la  cinétique  de  la  formation  de  la  chymotrypsine-S.  Dans  cette 
même  figure,  on  a  tracé  la  cinétique  de  la  formation  de  la  chymotrypsine-7r 
en  retranchant  la  chymotrypsine-S  des  quantités  totales  de  chymotrypsine 
données  par  l'isoleucine  N-terminale.  Les  proportions  de  chymotrypsine-vr 
peuvent  devenir  beaucoup  plus  grandes  si  l'on  empêche  l'autolyse  de  cet 
enzyme  par  le  jS-phénylpropionate.^ 

Avant  de  quitter  l'activation  'rapide',  notons  un  fait  assez  singulier:  11 
est  possible  de  couper  la  liaison  leucyl-sérine  par  la  chymotrypsine  (voir 
plus  loin)  sans  toucher  à  la  liaison  arginyl-isoleucine.  Or,  la  coupure  de 
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cette  liaison  n'active  pas  la  molécule.  Elle  respecte  simplement  dans  une 
certaine  mesure  son  aptitude  à  être  ultérieurement  activée.^  L'extrême  spéci- 
ficité des  mécanismes  en  jeu  pendant  l'activation  apparaît  ainsi  clairement. 
Les  remaniements  de  structure  nécessaires  à  l'activation  se  produisent  quand 
on  ouvre  la  liaison  arginyl-isoleucine.  Mais  ils  ne  se  produisent  pas  quand 
on  ouvre  une  autre  liaison  séparée  de  la  première  par  deux  résidus  seulement. 


II.  L'ACTIVATION  'LENTE'  DU  CHYMOTRYPSINOGÈNE 

Malgré  leur  importance  théorique  considérable,  les  chymotrypsines-7r  et  8 
ne  sont  pas  encore  très  connues.  La  chymotrypsine  la  plus  classique  est 
toujours  celle  que  cristallisèrent  Kunitz  et  Northrop'  en  1935.  Pour  obtenir 
leur  enzyme,  les  auteurs  adoptèrent  d'ailleurs  des  conditions  d'activation 
bien  particulières  que  l'on  appelle  maintenant  les  conditions  de  l'activation 
'lente'.  Ce  mode  d'activation  a  pour  caractéristique  essentielle  d'utiliser  des 
quantités  très  faibles  de  trypsine.  Il  faut  donc  attendre  longtemps  (40-48  h. 
à  5°C)  avant  que  le  processus  soit  terminé.  La  chymotrypsine- a,  qui  peut 
alors  être  cristallisée  avec  un  rendement  d'environ  50%,  possède  deux  résidus 
terminaux  de  plus  que  la  chymotrypsine-S  (l'alanine  en  position  N-terminale^ 
et  la  tyrosine  en  position  C-terminale).^"  Sa  formation  exige  donc  certaines 
proteolyses  additionnelles  dont  il  est  intéressant  de  rechercher  l'origine.  Le 
tableau  1  indique  les  résidus  terminaux  existant  dans  le  chymotrypsinogène, 
la  chymotrypsine-S  et  la  chymotrypsine- a.  Chaque  protéine  mentionnée  dans 
le  tableau  1  dérive  de  la  précédente  par  un  processus  de  protéolyse  limitée. 
Elle  contient  donc  tous  les  résidus  de  celle-ci  plus  les  résidus  engendrés  par 
la  protéolyse  qui  lui  a  donné  naissance. 

Tableau  1 

RÉSIDUS     TERMINAUX     DU     CHYMOTRYPSINOGÈNE 
ET    DES    CHYMOTRYPSINES    S    ET    a 


Chymotrypsinogène 
Chymotrypsine-8 
Chymotrypsine- a 

Résidus  N-terminaux 

Résidus  C-terminaux 

Cys 

Cys,  lieu 
Cys,  lieu,  Ala 

Leu 

Leu,  Leu 
Leu,  Leu,  Tyr 

La  présence  d'un  résidu  C-terminal  de  tyrosine  dans  la  chymotrypsine- a 
suggère  que  les  proteolyses  additionnelles  de  l'activation  'lente'  sont  dues 
à  la  chymotrypsine.  La  lenteur  du  phénomène  laisse  d'ailleurs  le  temps  au 
chymotrypsinogène  d'être  attaqué  par  la  chymotrypsine  et  à  la  chymo- 
trypsine-S, produit  final  de  l'activation  'rapide',  le  temps  de  s'autolyser.  Ces 
deux  phénomènes  existent.  Ils  donnent  tous  les  deux  naissance  à  un  résidu 
de  threonine  en  position  N-terminale.^  Ce  résidu  de  threonine  ne  fait  d'ail- 
leurs qu'une  apparition  relativement  brève.  Il  cède  bientôt  la  place  à  un 
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résidu  d'alanine,  lequel  se  trouve  justement  en  position  N-terminale  dans  la 
chymotrypsine-a  (tableau  1).  Une  telle  coincidence  n'est  vraisemblablement 
pas  fortuite.  Aussi,  avons-nous  cherché  à  mieux  caractériser  les  protéines 
résultant  de  l'attaque  chymotrypsique  du  chymotrypsinogène.  En  utilisant 
les  techniques  classiques,  nous  avons  réussi  à  cristalliser  deux  de  ces  pro- 
téines, l'une  contenant  la  threonine  N-terminale  et  l'autre,  l'alanine.  Nous 
avons  également  démontré  la  formation  effective  de  la  protéine  contenant 
l'alanine  et  la  serine  (voir  fig.  1)  N-terminales.  Ces  protéines  n'ont  pas  été 
obtenues  à  l'état  pur  car  la  cristaUisation  ne  possède  pas  ici  une  efficacité 
bien  considerable.*  Mais  les  protéines  en  question  ont  été  malgré  tout  pré- 
parées dans  un  état  suffisamment  défini  pour  permettre  la  réalisation  de 
quelques  expériences  significatives. 

Ces  protéines  sont  inactives.  Nous  ne  nous  en  étonnerons  pas  car  le  chymo- 
trypsinogène n'est  pas  activé  par  la  chymotrypsine  et  nous  savons  déjà  que 
l'on  active  pas  ce  zymogène  en  rompant  n'importe  quelle  liaison.  Mais  elles 
sont  encore  activables  par  la  trypsine  et  nous  les  avons  pour  cette  raison 
appelées  'néochymotrypsinogènes'.  Cette  activation  est  identique  sur  le  plan 
chimique  à  celle  du  chymotrypsinogène  ordinaire.  Elle  s'effectue  grâce  à 
l'ouverture  de  la  même  liaison  arginyl-isoleucine.  Mais  sa  vitesse  et  ses 
résultats  sont  bien  différents.  La  fig.  3  permet  d'étudier  l'apparition  de 
l'activité  chymotrypsique  quand  on  active  dans  les  mêmes  conditions  le 
chymotrypsinogène  (courbe  de  gauche)  et  le  néochymotrypsinogène  à  threo- 
nine N-terminale  (courbe  de  droite). 

Deux  remarques  peuvent  être  faites  à  propos  des  courbes  de  la  fig.  3. 
D'une  part,  l'activation  du  néochymotrypsinogène  est  nettement  plus  lente 
que  celle  du  chymotrypsinogène.  On  sait  que  la  liaison  arginyl-isoleucine 
manifeste  au  sein  du  chymotrypsinogène  une  vulnérabilité  vraiment  extra- 
ordinaire vis-à-vis  de  la  trypsine  puiqu'elle  est  coupée  dans  toutes  les  molé- 
cules après  quelques  minutes  à  0°C.  Cette  vulnérabilité  est  certainement  due 
à  un  élément  particulier  de  structure  que  le  chymotrypsinogène  possède  à 
un  très  haut  degré.  L'élément  en  question  est  loin  de  disparaître  au  moment 
des  ruptures  chymotrypsiques.  Mais  il  est  atténué  de  façon  sensible.  D'autre 
part,  l'activité  potentielle  du  néochymotrypsinogène,  c'est  à  dire  l'activité 
chymotrypsique  maximum  que  peut  acquérir  cette  protéine  dans  ces  conditions 
déterminées,  est  plus  faible  que  celle  du  chymotrypsinogène (2-4  au  lieude4-0). 
Les  ruptures  chymotrypsiques  n'empêchent  donc  pas  la  formation  du  centre 
actif.  Mais  elles  changent  vraisemblablement  sa  structure  dans  un  sens 
défavorable,  en  modifiant  par  exemple  les  distances  entre  les  divers  groupe- 
ments constituant  le  centre.  En  fait,  les  néochymotrypsinogènes  donnent 
naissance  aux  chymotrypsines  de  l'activation  'lente'  (chymotrypsines  de  type 
a)  dont  l'activité  spécifique  est  inférieure  à  celle  des  chymotrypsines-7r  et  8. 
Tout  semble  ainsi  se  passer  comme  si  les  proteolyses  limitées  qu'effectue 

*  La  Chromatographie  ne  permet  pas  non  plus,  comme  nous  allons  le  voir  tout  à 
l'heure,  de  purifier  complètement  les  protéines  en  question. 
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la  chymotrypsine  au  sein  du  chymotrypsinogène  ne  bouleversent  pas  suffise- 
ment  la  structure  de  la  molécule  pour  lui  faire  perdre  son  caractère  de  zymo- 
gène.  Mais  elles  atténuent  néanmoins  ce  caractère  à  plusieurs  points  de  vue. 
La  substitution  précédemment  constatée  de  la  threonine  N-terminale  par 
Talanine  suggère  en  outre  que  l'attaque  chymotrypsique  du  chymotryp- 
sinogène comporte  à  un  moment  donné  la  libération  de  un  ou  de  plusieurs 
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Fig.  3.  Activation  du  chymotrypsinogène  et  du  néochymotrypsinogène  à 
N-terminale. 

Courbe  de  gauche:  activation  du  chymotrypsinogène. 

Courbe  de  droite:  activation  du  néochymotrypsinogène  à  threonine  N-terminale. 

Ordonnées:  activités  spécifiques  des  hydrolysats  trypsiques  vis  à  vis  de  l'acétyl-L- 
tyrosine  éthylester.  Conditions  de  l'activation:  zymogène:  5  mg  par  ml.  Rapport  pondéral 
enzyme  substrat:  l/40.pH  =  7-6.  Temp.:  0°C. 


peptides.  La  fig.  4  (diagramme  du  haut)  montre  que  le  peptide  est  unique* 
et  que  sa  structure  est  celle  de  la  thréonylasparagine. 

L'identification  de  la  thréonylasparagine  nous  éclaire  sur  le  mécanisme 
de  la  genèse  des  néochymotrypsinogènes.  Ce  mécanisme  est  reproduit  dans 
la  fig.  5. 

Comme  on  le  voit,  ce  mécanisme  est  curieusement  semblable  à  celui  de 
l'activation  'rapide'  schématisé  dans  la  fig.  1.  Il  comporte  en  effet  la  rupture 
de  deux  liaisons  et  la  libération  d'un  dipeptide.  La  première  liaison  rompue 

*  Une  enquête  très  approfondie  a  été  faite  en  vue  de  savoir  si  la  thréonylasparagine  est 
vraiment  le  seul  peptide  libéré.  Il  n'existe  ni  peptides  non-réactifs  vis  à  vis  de  la  ninhydrine, 
ni  peptides  contenant  de  la  cystine  ou  du  tryptophane  (lesquels,  on  le  sait,  migrent  mal 
dans  les  colonnes  de  Dowex-50). 


THQ.ASPf.-iH^] 


THß  AipfhH^] 


vyivvO  I.-a/V\A\^ 


-75^ 


da  o^N  pH  i.25     à    iN    pH  5.19- 


Fig.  4.  Peptides  engendrés  par  l'attaque  chymotrypsique  du  chymotrypsinogène  et  par 
l'activation  'lente'  de  ce  zymogène  (11). 

Diagramme  du  haut:  attaque  du  chymotrypsinogène  (4-4x10~3m)  par  la  chymotryp- 
sine-a  (0-45  x  IQ-^m)  à  0°C,  pH  =  7-6  pendant  46  h. 

Diagramme  du  bas:  Activation  'lente'  du  chymotrypsinogène  (4-4 x  10~%)  par  la  tryp- 
sine  (4-8  x  IO^'^m)  en  présence  de  S04Am2  0-3m  à  6°C,  pH  =  7-6  pendant  46  h. 

Technique  de  Moore  et  Stein  avec  gradient  de  pH  et  de  force  ionique.  Colonne  de 
87x0-9  cm.  Volume  de  la  chambre  de  mélange:  330  ml.  Débit  8  ml/h.  Volume  des  frac- 
tions: 1  ml. 
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Chymotrypsinogène       Nèo -Thr  Nèo-Ala 

Fig.  5.  Schéma  des  proteolyses  chymotrypsiques  donnant  naissance  aux  néochymo- 
trypsinogènes.  Les  proteolyses  en  question  ont  été  placées  dans  la  partie  'gauche'  de  la 
molécule,  par  opposition  à  la  partie  'droite'  où  se  produisent  les  proteolyses  de  l'activa- 
tion 'rapide'  (fig.  1).  Cette  convention  est  tout  à  fait  arbitraire.  L'essentiel  est  que  les 
deux  groupes  de  proteolyses  se  fassent  de  part  et  d'autre  d'un  pont  interne  matérialisé 
dans  le  schéma  par  un  trait  en  pointillé.  Sinon,  la  molécule  se  séparerait  en  plusieurs 
morceaux. 


Lps 
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(liaison  3  :  tyrosyl-thréonine)  correspond  bien  à  la  spécificité  de  la  chymo- 
trypsine.  La  seconde  rupture  est  un  peu  plus  surprenante  puisqu'elle  s'effectue 
à  coté  d'un  résidu  d'asparagine  pour  lequel  la  chymotrypsine  ne  semble  pas 
avoir  d'affinité  particulière.  Mais  cette  rupture  est  plus  lente  que  la  première. 
Il  est  en  outre  intéressant  de  constater  que  la  présence  de  sulfate  d'ammonium 
l'accélère  beaucoup.*  On  sait  que  les  ions  calcium  accélèrent  également  la 
Proteolyse  trypsique  d'une  liaison  lysyl-isoleucine  dans  le  trypsinogène  et 
qu'ils  favorisent  ainsi  l'activation  de  ce  zymogène.^^  L'influence  exercée  par 
certains  ions  métalliques  sur  les  phénomènes  de  protéolyse  est  bien  connue. 
Le  fait  nouveau  est  que  cette  influence  puisse  dans  les  deux  examples  pré- 
cédents être  localisée  au  niveau  d'une  liaison  déterminée.  Une  telle  spécificité 
est-elle  due  à  des  interactions  de  l'ion  avec  le  substrat,  modifiant  sa  forme 
ou  attirant  l'enzyme  vers  un  point  précis?  L'ion  agit-il  au  contraire  sur 
l'enzyme  en  modifiant  son  affinité  et  certains  de  ses  propriétés?  Des  ex- 
périences en  cours  au  laboratoire  permettront  sans  doute  de  répondre  bien- 
tôt à  ces  importantes  questions. 

Le  diagramme  du  bas  de  la  fig.  4  est  relatif  aux  peptides  engendrés  pendant 
l'activation  'lente'  du  chymotrypsinogène.  A  première  vue,  ce  diagramme 
semble  assez  compliqué.  Il  est  en  fait  aussi  simple  que  le  premier  car  les 
deux  pics  centraux  A  et  B  correspondent,  l'un  à  l'ammoniaquef  et  l'autre, 
à  un  mélange  de  plusieurs  peptides  dont  les  proportions  individuelles  sont 
très  faibles.  Nous  n'avons  donc  à  considérer  que  deux  pics  significatifs:  le 
pic  de  la  sérylarginine  qui  montre  que  le  principe  de  l'activation  'lente'  est 
le  même  que  celui  de  l'activation  'rapide'  et  le  pic  de  la  thréonylasparagine, 
qui  montre  que  les  proteolyses  additionnelles  de  l'activitation  'lente'  sont 
dues  à  l'attaque  chymotrypsique  du  chymotrypsinogène  ou/et  à  l'autolyse 
de  la  chymotrypsine-S, 

Le  caractère  mixte  du  processus  de  l'activation  'lente'  apparaît  ainsi  en 
toute  clarté.  Il  comporte  la  rupture  d'une  liaison  par  la  trypsine  et  la  rup- 
ture de  trois  liaisons  (au  maximum)  par  la  chymotrypsine.  Quand  on  coupe 
successivement  quatre  liaisons  dans  un  ordre  quelconque,  2*  =  16  possi- 
bilités sont  en  principe  offertes.  La  première  correspond  à  l'état  initial.  Les 
quinze  autres  sont  trouvées  en  associant  1  par  1,  2  par  2,  3  par  3  et  4  par  4 
les  quatre  ruptures  envisagées.  La  fig.  6  donne  un  aperçu  de  ces  quinze 
possibihtés. 

Les  quinze  possibilités  mentionnées  dans  la  fig.  6  ne  sont  d'ailleurs  à 
considérer  que  si  les  quatre  ruptures  interviennent  dans  un  ordre  quel-conque. 

*  Il  est  curieux  de  noter  que  Kunitz  et  Northrop  ont  obtenu  la  chymotrypsine- a  parce- 
qu'ils  ont  activé  un  cake  de  chymotrypsinogène  contenant  50%  de  sulfate  d'ammonium. 
En  activant  du  chymotrypsinogène  dépourvu  de  sel,  ils  auraient  principalement  obtenu 
une  autre  chymotrypsine  de  type  a,  la  chymotrypsine-aj,  contenant  de  la  threonine  en 
position  N-terminale. 

f  Comme  il  vient  d'être  dit,  l'activation  'lente'  est  réalisée  en  présence  d'une  quantité 
très  substantielle  de  sulfate  d'ammonium.  Les  traitements  classiques  de  dessalage  n'arrivent 
pas  à  éliminer  la  totalité  des  ions  ammonium  apportés  par  ce  sel. 
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Pendant  l'activation  'rapide',  la  rupture  trypsique  de  la  liaison  1  est 
quasi-immédiate  et  elle  est  suivie  à  brève  échéance  par  l'autolyse  de  la 
liaison  2.  Le  phénomène  est  donc  simple.  Il  consiste  à  convertir  le  chymo- 
trypsinogène  successivement  en  deux  enzymes  (tt  et  8)  dont  le  dernier  jouit 
d'une  relative  stabilité.  Pendant  l'activation  'lente'  au  contraire,  la  lenteur 
des  réactions  introduit  un  élément  de  désordre  et  de  complexité.  D'une  part, 


Chymotrypsines 


m 


\  2 

Neochymotrypsinogenes  )  [T]    [  34  | 


13       14 


124     134 


c< 

1234 


23       24     |234| 


Fig.  6.  Possibilités  offertes  par  la  rupture  de  quatre  liaisons  dans  le  chymotrypsinogène. 
Les  cadres  indiquent  les  protéines  d'ores  et  déjà  caractérisées:  quatre  chymotrypsines, 
dont  deux  engendrées  par  l'activation  'rapide'  (77  et  S)  et  deux  engendrées  par  l'activation 
'lente'  (a  et  oj);  et  trois  neochymotrypsinogenes  contenant  (en  plus  du  résidu  de  demi- 
cystine),  les  résidus  N-terminaux  suivants:  threonine  (3),  alanine  (34),  alanine  et  serine 
(234.) 

l'hydrolyse  trypsique  de  la  liaison  1  n'est  plus  forcément  terminée  avant  le 
début  des  hydrolyses  chymotrypsiques.  D'autre  part,  ces  hydrolyses  chymo- 
trypsiques  ne  s'effectuent  pas  forcément  selon  un  ordre  immuable.  Les 
mélanges  engendrés  sont  donc  relativement  complexes.  On  sait  que  la 
chymotrypsine-a  s'obtient  avec  un  rendement  assez  mauvais  et  qu'elle  n'est 
pas  encore  électrophorétiquement  homogène  après  de  nombreuses  cristal- 
lisations. 

Dès  que  la  liaison  1  est  rompue,  les  molécules  sont  actives.  Les  huit  pro- 
téines de  la  ligne  du  haut  sont  donc  des  chymotrypsines.  Les  deux  premières 
sont  les  chymotrypsines-TT  et  8  de  l'activation  'rapide'.  Les  six  autres  sont 
en  principle  du  domaine  de  l'activation  'lente'.  Mais  elles  n'ont  peut-être 
pas  toutes  une  existence  réelle.  La  dernière  est  la  chymotrypsine-a  dont  la 
formation  exige  le  nombre  maximum  de  ruptures  (1234).  La  protéine  123 
est  la  chymotrypsine-ai  dont  l'existence  est  rendue  probable  par  la  présence 
de  threonine  N-terminale  dans  les  hydrolysats  de  l'activation  'lente'. 

Toutes  les  autres  possibilités  correspondent  en  principe  à  des  neochymo- 
trypsinogenes. Parmi  ceux-ci,  deux  ont  été  purifiés  (3  et  34)  et  un  a  été 
caractérisé  (234).  Rien  n'indique  pour  l'instant  que  la  vitesse  relative  des 
ruptures  permet  aux  autres  neochymotrypsinogenes  de  se  former  en  quan- 
tités décelables. 
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III.    ETUDE    PRÉLIMINAIRE    DE    L'ATTAQUE    DU 

CHYMOTRYPSINOGÈNE    PAR    LA 

LEUCYLAMINOPEPTIDASE 

Les  proteolyses  précédentes  sont  assez  simples  pour  que  leur  mécanisme 
puisse  être  éclairci  de  façon  complète.  Mais,  l'endroit  précis  où  elles  se 
produisent,  c'est  à  dire  leur  position  absolue  et  relative  dans  la  molécule 
nous  restera  inconnu  aussi  longtemps  que  l'ordre  d'enchaînement  des  résidus 
de  cette  molécule  n'aura  pas  été  déterminé.  En  attendant  que  ce  travail  ait 
été  réalisé,  il  paraît  intéressant  d'attaquer  le  chymotrypsinogène  par  des 
enzymes  doués  d'une  spécificité  précise  de  position  et  d'observer  les  réper- 
cussions de  cette  attaque  sur  l'activité  potentielle  du  zymogène.  Nous  avons 
dans  ce  but  utilisé  la  leucylaminopeptidase  du  rein  récemment  purifiée  par 
Smith  et  al.^^  On  sait  que  cet  enzyme  agit  de  préférence  sur  les  peptides. 
Mais  il  attaque  aussi  certaines  protéines  à  partir  de  leurs  extrémités  N- 
terminales.^*'^^ 

Le  chymotrypsinogène  contient  un  résidu  N-terminal  de  demi-cystine.^° 
Ce  résidu  n'est  pas  simplement  relié  au  reste  de  la  molécule  par  son  pont 
disulfure  (schéma  A)  car  il  suffirait  alors  d'oxyder  le  dinitrophényl  (DNP) 
chymotrypsinogène  par  l'acide  performique  pour  obtenir  de  l'acide  DNP- 
cystéique  libre.  Or,  ce  dérivé  n'est  obtenu  qu'après  oxydation  et  hydrolyse. 
Le  résidu  de  demi-cystine  ne  se  trouve  pas  non  plus,  comme  on  l'a  sug- 
géré,^^  en  position  latérale  sur  un  groupe  e — NH2  de  la  lysine  (schéma  B). 

Cy Lys S— 

I  1  I 

S  NH  NHgCy.Gly— 

I  I 

S  CO 

I  NHaCys 

NHaCyCOOH 

ABC 

L'expérience  suivante  montre  qu'il  est  situé  à  coté  d'un  résidu  de  glyco- 
colle  (schéma  C):  Du  DNP-chymotrypsinogène  oxydé  par  l'acide  per- 
formique est  hydrolyse  pendant  30  min.  dans  HCl  6  N  bouillant.  Le  résidu 
solide  est  hydrolyse  à  nouveau  dans  les  mêmes  conditions.  Les  fractions 
solubles  obtenues  après  trois  opérations  semblables  sont  évaporées  et  traitées 
par  le  Dowex-50  x  8  forme  H+.^°  Les  substances  non-adsorbées  sont  chroma- 
tographiées  sur  papier  dans  le  système  butanol-acide  formique.  La  seule 
tache  jaune  obtenue,  dont  le  Rf  est  identique  à  celui  de  l'acide  DNP- 
cystéique,  est  chromatographiée  à  nouveau  à  l'aide  de  méthyléthylcétone 
saturée  d'eau  ou  de  phénol  tamponné  à  pH  4.  On  obtient  dans  les  deux  cas 
deux  taches  jaunes  dont  l'une  migre  comme  l'acide  DNP-cystéique  et  l'autre 
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migre  un  peu  plus  vite.  Cette  dernière  fournit  par  hydrolyse  des  quantités 
stoechiométriquement  égales  d'acide  DNP-cystéique  et  de  glycocolle.  Il  s'agit 
donc  du  peptide  DNP-Cys(S03H).Gly.  Le  chymotrypsinogène  possède  ainsi 
une  véritable  extrémité  N-terminale  Cys.Gly — .  Cette  extrémité  peut  en 
principe  constituer  un  point  d'attaque  pour  l'aminopeptidase.  Le  tableau  2 
résume  les  principaux  résultats  obtenus. 

RÉSULTATS    PRÉLIMINAIRES    CONCERNANT    L'ATTAQUE    DU 
CHYMOTRYPSINOGENE    PAR    LA    LEUCYLAMINOPEPTIDASE 


ChTg:  chymotrypsinogène 


Aminopeptidase 

ChTg  cristallisé:  (C) 

ChTg  spécialement 

purifié:  (P) 

Durée  de 

l'expérience 

(h.) 

Quantités 

d'amino 

acides 

libérés 

Activité 
spécifique 

Nb  d'unités  par 
/i-mole  de  ChTg 

38 
51 
51 
51 

92 
90-100 

4 
2 

003 
0-8 

2 
9 

C 
C 
C 

p 

p 
P 

1/4 

1/2 

1/4 

1/4 

1 

18 

2 

5 

2 

23 

+  +  + 

+  +  + 

+  + 

0 

+ 
+  +  + 

0 

+ 

0 

+ 

Les  premières  expériences  ont  été  réalisées  avec  des  préparations  d'amino- 
peptidase  purifiées  par  précipitations  fractionnées.  Leurs  activités  spécifiques 
vis-à-vis  de  la  L-leucine  amide  étaient  comprises  entre  30  et  50.  Le  chymo- 
trypsinogène était  un  échantillon  commercial,  cristallisé  trois  fois  au  labora- 
toire selon  les  techniques  classiques,  lyophilisé  et  conservé  à  —  10°C  pendant 
quelques  mois.  Les  premières  expériences  réalisées  dans  ces  conditions  don- 
nent l'impression  que  le  chymotrypsinogène  est  bien  attaqué  par  l'amino- 
peptidase. 0-03  unité  d'enzyme  par  ju,  mole  de  substrat  libère  au  bout  d'un 
quart  d'heure  des  quantités  déjà  considérables  d'amino  acides.  Toutefois, 
en  analysant  ultérieurement  notre  échantillon  de  chymotrypsinogène,  nous 
y  avons  trouvé  0-2  résidu  de  threonine  N-terminale  par  mole.  Bien  qu'il 
ait  été  gardé  constamment  à  —  10°C,  cet  échantillon  contenait  donc  20% 
de  néochymotrypsinogène  vraisemblablement  formé  par  un  peu  de  chymo- 
trypsine  provenant  elle-même  d'une  trace  de  trypsine. 

Or,  cette  hétérogénéité  grossière  pouvait  difficilement  être  mise  en  évidence 
par  les  techniques  classiques  de  la  dynamique  moléculaire.  Au  moment  où 
nous  avons  préparé  les  néochymotrypsinogènes,  nous  nous  sommes  en  efiet 
efforcés  de  séparer  ces  substances  du  chymotrypsinogène  ordinaire  par 
Chromatographie  sur  Amberlite  IRC-50.  Les  essais  ont  été  infructueux.  Dans 
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toute  la  zone  des  pH  admissibles,  la  vitesse  de  migration  a  été  la  même. 
L'échantillon  en  cause  aurait  donc  fourni  un  pic  parfaitement  symétrique 
dans  une  chromatographic  de  ce  type.  En  outre,  l'électrophorèse  n'est  vrai- 
ment efficace  dans  le  cas  de  protéines  aussi  voisines  qu'à  la  condition  que 
l'un  des  constituants  du  mélange  possède  une  polarité  nettement  différente 
des  autres.  L'unique  raison  du  succès  rencontré  par  Bettelheim  et  Neurath^ 
dans  la  séparation  électrophorétique  des  chymotrypsines-7r  et  8  est  la  pré- 
sence au  sein  de  la  première  d'un  résidu  supplémentaire  d'arginine.  Dans 
le  cas  actuel,  le  néochymotrypsinogène  à  threonine  N-terminale  possède 
exactement  la  même  composition  que  le  chymotrypsinogène  et  le  néochymo- 
trypsinogène à  alanine  N-terminale  n'en  diffère  que  par  deux  résidus  (threo- 
nine et  asparagine)  dont  la  polarité  est  très  faible.  La  seule  méthode  vraiment 
efficace  pour  déceler  l'hétérogénéité  de  l'échantillon  était  donc  la  méthode 
chimique  de  détermination  des  groupements  terminaux.  Le  fait  est  digne 
d'être  noté  car  il  peut  se  reproduire  dans  d'autres  cas. 

La  présence  de  néochymotrypsinogènes  dans  le  chymotrypsinogène  n'est 
d'ailleurs  pas  gênante  quand  on  se  propose  de  déterminer  l'ordre  d'enchaine- 
ment  des  résidus.  Cette  étude  exige  en  effet  que  l'on  rompe  de  nombreuses 
liaisons  et  il  importe  peu  que  une  ou  deux  liaisons  soient  déjà  rompues 
dans  certaines  des  molécules  utihsées.  Mais,  quand  on  veut  étudier  la  struc- 
ture globale  du  chymotrypsinogène  ou  l'action  d'une  exopeptidase  sur  cette 
protéine,  une  telle  hétérogénéité  peut  provoquer  des  erreurs  très  graves.  En 
somme,  le  chymotrypsinogène  est  beaucoup  plus  stable  que  le  trypsinogène 
et  le  pepsinogène  parce  que  son  activation  n'est  pas  autocatalytique.  Mais, 
il  est  peu  raisonnable  d'attendre  que  le  précurseur  d'un  enzyme  protéolytique, 
si  aisément  activé  par  la  trypsine  et  attaqué  lentement  par  son  propre  enzyme, 
puisse  jouir  d'une  parfaite  stabilité. 

Quoi  qu'il  en  soit,  nous  avons  réussi  à  préparer  un  échantillon  de  chymo- 
trypsinogène complètement  dépourvu  de  résidus  terminaux  parasites  (autres 
que  le  résidu  de  demi-cystine)  en  procédant  à  plusieurs  cristalhsations  en 
présence  d'une  quantité  substantielle  de  diwopropylfluorophosphate.  Cet 
échantillon,  dont  la  stabiUté  paraît  assez  satisfaisante,  est  désigné  dans  le 
tableau  2  par  la  lettre  P.  Il  libère  au  contact  de  l'aminopeptidase  beaucoup 
moins  d'amino  acides  que  le  premier. 

D'autres  expériences  ont  été  réalisées  avec  des  préparations  d'amino- 
peptidase  spécialement  purifiées  par  électrophorèse  de  zone  dans  une  colonne 
d'amidon.^'  Leur  activité  spécifique  atteignait  100,  c'est  à  dire  la  valeur  la 
plus  haute  enregistrée  jusqu'ici.  On  voit  que  cette  nouvelle  préparation  est 
légèrement  moins  active  que  les  anciennes  sur  l'échantillon  pur  de  chymo- 
trypsinogène. Il  n'est  donc  pas  toujours  prudent  de  faire  agir  sur  les  pro- 
téines, comme  on  le  conseille  parfois,  des  préparations  d'aminopeptidase 
dont  l'activité  spécifique  n'excède  pas  40. 

Enfin,  dans  les  dernières  expériences  du  tableau  2,  une  aminopeptidase 
aussi  pure  que  possible  a  agi  sur  un  chymotrypsinogène  spécialement  purifié. 
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9  unités  d'enzyme  par  /x-mole  de  substrat  ont  alors  engendré  au  bout  de 
23  h.  environ  0-5  mole/mole  de  7  ou  8  amino  acides,  parmi  lesquels  se  trou- 
vait le  glycocolle  qui  occupe  la  deuxième  position  dans  la  séquence  N- 
terminale.  On  pouvait  donc  avoir  l'espoir  d'avoir  attaqué  le  chymotryp- 
sinogène  à  l'endroit  attendu.  Toutefois,  si  la  liaison  cystyl-glycocolle  avait 
été  réellement  rompue  par  l'aminopeptidase,  le  résidu  de  demi-cystine  n'aurait 
plus  été  attaché  que  par  son  pont  disulfure  (voir  plus  haut)  et  il  aurait  été 
susceptible  d'être  détaché  par  une  simple  oxydation.  En  fait,  l'attaque  en- 
zymatique  ne  modifie  en  rien  le  comportement  de  ce  résidu. 

Le  fait  surprenant  n'est  d'ailleurs  pas  que  l'aminopeptidase  éprouve 
quelques  difficultés  à  attaquer  la  séquence  N-terminale  du  chymotrypsino- 
gène.  Cette  séquence  se  compose  en  effet  d'un  résidu  un  peu  particulier  de 
demi-cystine  et  d'un  résidu  de  glycocolle  pour  lequel  l'enzyme  ne  semble 
pas  posséder  beaucoup  d'affinité.  Des  expériences  ultérieurs  nous  diront  si 
la  séquence  en  question  peut  être  attaquée  dans  d'autres  conditions  et  si 
les  séquences  des  néochymotrypsinogènes  et  des  chymotrypsines  sont  plus 
accessibles.  Mais,  malgré  leur  caractère  négatif,  nos  observations  montrent 
dès  maintenant  avec  quelle  prudence  il  faut  interpréter  les  résultats  fournis 
par  l'aminopeptidase,  surtout  quand  il  s'agit  d'expériences  de  longue  durée, 
réalisées  sur  des  protéines  dont  la  stabilité  n'est  pas  absolue. 

RÉSUMÉ 

Le  présent  mémoire  décrit  en  détail  les  proteolyses  limitées  subies  par  le 
chymotrypsinogène  de  boeuf  au  cours  de  ses  activations  'rapide'  et  'lente'. 
Il  donne  aussi  quelques  indications  préliminaires  concernant  l'action  de  la 
leucylaminopeptidase  sur  ce  zymogène. 

1.  L'activation  du  zymogène  résulte  de  la  rupture  d'une  seule  liaison 
reliant  un  résidu  d'arginine  à  un  résidu  d'isoleucine.  La  chymotrypsine-7r 
ainsi  engendrée  s'autolyse  au  niveau  d'une  liaison  leucyl-sérine,  en  donnant 
la  chymotrypsine-S  dont  l'activité  est  identique  et  un  dipeptide,  la  séryl- 
arginine.  Le  phénomène  est  très  spécifique.  Chaque  enzyme  coupe  une  seule 
liaison  à  concurrence  de  95%.  La  rupture  de  la  liaison  leucyl-sérine,  laquelle 
est  séparée  de  la  liaison  arginyl-isoleucine  par  deux  résidus  seulement,  ne 
provoque  aucune  activation. 

2.  Pendant  l'activation  'lente',  deux  processus  supplémentaires  ont  le 
temps  de  se  dérouler:  l'attaque  chymotrypsique  du  chymotrypsinogène  et 
l'autolyse  de  la  chymotrypsine-S.  Ces  deux  processus  provoquent  la  rup- 
ture des  deux  mêmes  liaisons  (tyrosyl-thréonine  et  asparaginyl-alanine)  et 
la  libération  d'un  dipeptide  (la  thréonylasparagine).  Les  nouvelles  protéines 
ainsi  engendrées  (néochymotrypsinogènes)  sont  encore  activables  par  la 
trypsine.  Mais  leur  activation  est  plus  lente  et  les  chymotrypsines  formées 
sont  moins  actives. 

3.  Etant  le  précurseur  d'un  enzyme  protéoly tique  et  étant  extrêmement 
vulnérable  vis-à-vis  de  la  trypsine,  le  chymotrypsinogène  ne  paraît  pas  doué 
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d'une  parfaite  stabilité.  La  présence  de  quantités  considérables  de  néochymo- 
trypsinogènes  dans  le  chymotrypsinogène  ne  semble  pas  pouvoir  être  décelée 
par  les  techniques  classiques  de  la  dynamique  moléculaire. 

4.  Malgré  toutes  les  précautions  prises  pour  purifier  le  chymotrypsinogène 
et  la  leucylaminopeptidase,  cet  enzyme  libère  des  quantités  très  notables 
d'amino  acides  qui  ne  proviennent  pas  de  la  séquence  N-terminale  de  la 
molécule.  Cette  séquence  est  Cys.Gly — . 
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Studies  on  the  autocatalytic  activation  of 
trypsinogen 

J.  F.  PECHÈRE  AND  HANS  NEURATH 

Department  of  Biochemistry,  University  of  Washington,  Seattle,  Washington 


Recent  work  on  the  autocatalytic  conversion  of  trypsinogen  to  trypsin^ -^ 
has  shown  that  during  the  activation,  and  simultaneously  with  the  appear- 
ance of  tryptic  activity,  a  lysyl-isoleucine  bond  is  broken  in  the  A^-terminal 
region  of  the  trypsinogen  molecule,  and  that  correspondingly  increasing 
amounts  of  a  hexapeptide,  having  the  structure  valyl-(aspartyl)4-lysine,  can 
be  isolated  from  the  activation  mixtures.  A  similar  parellelism  has  been 
found  to  exist  between  the  appearance  of  enzymatic  activity  and  a  decrease 
in  the  specific  levorotation  of  the  protein.^  While  the  correlation  between 
these  chemical,  physical  and  enzymatic  parameters  appears  to  be  well  estab- 
lished, the  interpretation  of  these  data  is  not  self-evident.  Thus,  the  ques- 
tion remains  how  the  removal  of  a  small  peptide  from  the  iV-terminal  region 
of  the  single  polypeptide  chain  of  trypsinogen  can  account  for  the  appear- 
ance of  enzymatic  activity.  In  consideration  of  this  problem,  attempts  were 
made  to  establish  additional  correlations  between  chemical  and  physical 
properties  and  to  look  for  other  events  which  would  help  to  provide  a 
molecular  interpretation  for  the  formation  of  an  enzyme  from  its  inactive 
precursor. 

Such  additional  correlations  have  been  recently  established  by  investi- 
gating the  following  properties  during  the  activation  of  trypsinogen  :  (a)  the 
appearance  of  trypsin,  in  the  form  of  soybean-trypsin  inhibitor  compound, 
as  determined  by  moving  boundary  electrophoresis  of  partial  activation 
mixtures,  and  (b)  measurements  of  the  base  consumption  during  activation, 
as  determined  in  the  Jacobsen-Léonis  autotitrator.  These  were  compared  with 

(c)  the  amount  of  hexapeptide,  as  isolated  chromatographically  from  the 
trichloroacetic  acid  (TCA)-soluble  fraction  of  activation  mixtures,^  and  with 

(d)  the  percentage  change  in  optical  rotation. ^  All  these  results  are  summarized 
in  Fig.  1 .  In  each  of  the  four  diagrams  shown  in  this  figure,  the  percentage 
change  in  property  during  activation  is  plotted  against  the  time  of  activa- 
tion; the  solid  line  represents  in  each  case  the  appearance  of  enzymatic 
activity  (benzoyl-L-arginine  ethyl  ester  as  substrate),  as  calculated  by  the 
Kunitz  equation^  from  experimental  points  (not  shown).  It  is  evident  that 
the  change  in  five  different  chemical  or  physical  properties  follows  exactly 
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the  same  rate  law,  indicating  that  they  are  all  manifestations  of  one  and  the 
same  molecular  event.  The  correlation  between  activity  and  amount  of  iso- 
lated hexapeptide  indicates  that  the  lysyl-isoleucine  bond  is  broken  simul- 
taneously with  activation.  The  correlation  between  activity  and  total  amount 
of  trypsin  appearing  indicates  that  no  intermediate  compound  having 
enzymatic  activity  is  formed  during  the  activation  process,  a  conclusion 
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Fig.  1.  Correlation  between  experimental  parameters  and  appearance  of  enzymatic 
activity  during  the  autocatalytic  activation  of  trypsinogen.  The  following  conditions 
applied  to  all  the  experiments:  trypsinogen  concentration  approx.  12-5  mg/ml.;  trypsino- 
gen/trypsin  ratio  20  :  1  ;  0-1  M  Tris  buffer  (pH  8  0)  containing  0-05  M-CaCl2;  temperature 
approx.  0°  C.  In  all  experiments  except  those  shown  on  the  left  top  quadrant,  indole  was 
also  present  to  minimize  the  action  of  a  slight  chymotryptic  impurity  which  was  found 
to  be  present  in  the  trypsinogen  and  trypsin  preparations  used. 

which  clearly  differentiates  the  activation  of  trypsinogen  from  that  of  chymo- 
trypsinogen.^  The  correlation  between  activity  and  change  in  optical  rota- 
tion supports  the  view  that  molecular  rearrangement  (toward  a  more  nearly 
helical  configuration)  is  an  essential  aspect  of  the  activation  process;  and 
the  correlation  between  enzymatic  activity  and  base  consumption  is  in 
accord  with  the  assumption  that  one  and  only  one  peptide  bond  is  broken 
during  the  reaction.  This  latter  conclusion  is  dependent  in  part  on  an  accu- 
rate knowledge  of  the  pK  of  the  amino  group  which  is  formed  during 
enzymatic  hydrolysis.  In  the  present  experiments,  a  pK  of  81  at  0°  was 
assumed,  whereas  a  pK  of  7-6  would  have  to  be  assumed  if  two  bonds  were 
broken  under  these  conditions.  It  is  also  recognized  that  the  hydrolysis  of 
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a  peptide  bond  involving  a  more  basic  amino  group,  such  as  the  e-amino 
group  of  lysine,  could  not  be  detected  by  these  measurements. 

Trypsinogen  contains  16  peptide  bonds  which,  on  the  basis  of  the  speci- 
ficity of  trypsin,  are  potentially  susceptible  to  hydrolysis  (14  lysyl  and 
2  arginyl  bonds).  In  order  to  exclude  the  possible  opening,  during  activa- 
tion, of  a  peptide  bond  involving  the  carboxyl  group  of  arginine,  acetylated 
trypsinogen  was  prepared,  by  the  use  of  acetic  anhydride,  and  subjected  to 
ion  exchange  chromatography.  There  appeared  to  be  an  inverse  relation 
between  the  degree  of  acetylation  and  the  degree  of  activation  of  tryp- 
sinogen, suggesting  that  fully  acetylated  trypsinogen  cannot  be  activated 
at  all. 

Whereas  during  activation  the  A^-terminal  valine  of  trypsinogen  is  re- 
placed by  an  isoleucine  group,'^  the  C-terminal  sequence  appears  to  be 
unaffected  during  this  process.  This  conclusion  is  based  primarily  on  the 
negative  evidence  that  the  C-terminal  group  of  trypsinogen  appears  to  re- 
main unreactive,  before  and  after  activation,  toward  carboxypeptidase, 
'basic  carboxypeptidase'  or  toward  hydrazinolysis.  The  lack  of  reactivity  of 
the  C-terminal  group  in  trypsinogen  and  activated  trypsinogen  is  a  structural 
feature  which  is  shared  by  chymotrypsinogen  as  well  and  which  permits 
various  interpretations,  as  shown  in  Fig.  2.  Structure  1  contains  a  peptide 
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Fig.  2.  C-terminal  structures  which  would  yield  negative  results  by  enzymatic  (carboxy- 
peptidases)  assay  or  by  the  hydrazinolysis  method. 

bond  between  the  C-terminal  carboxyl  group  and  an  e-amino  group  else- 
where on  a  side  chain.  Structure  2  involves  a  peptide  bond  between  the 
€-amino  group  of  a  C-terminal  lysine  and  a  side  chain  carboxyl  group  else- 
where along  the  polypeptide  chain.  Structure  3  has  a  C-terminal  cystine; 
and  structure  4,  a  C-terminal  a-amide.  All  of  these  structures  may  be 
expected  to  yield  negative  results  by  the  enzymatic  or  chemical  end-group 
analyses  previously  mentioned.  The  lack  of  reactivity  of  activated  trypsinogen 
toward  basic  carboxypeptidase  is  to  be  contrasted  with  recent  work  of 
Gladner,  Folk,  and  Laki®  who  have  reported  that  crystalline  trypsin  yields 
approximately  three  equivalents  of  lysine  when  incubated  with  this  enzyme. 
Since,  in  the  present  experiments,  end-group  analysis  was  carried  out  on 
freshly-prepared  activation  mixtures  as  soon  as  maximum  tryptic  activity  was 
reached,  it  seems  likely  that  during  the  process  of  isolation  of  crystaUine 
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trypsin  an  additional  lysyl-peptide  bond  had  been  hydrolyzed  by  trypsin- 
Because  of  the  specificity  requirements  of  trypsin,  these  negative  results  also 
seem  to  exclude  the  above  mentioned  possibiUty  of  hydrolysis  during  activa- 
tion of  a  peptide  bond  involving  the  e-amino  group  of  a  lysyl  residue. 

On  the  basis  of  all  evidence  considered,  therefore,  the  hydrolysis  of  the 
lysyl-isoleucine  bond  in  the  iV-terminal  region  of  the  single  polypeptide 
chain  of  trypsinogen  seems  to  be  the  sole  chemical  event  occurring  during 
autocatalytic  activation;  the  configurational  changes,  demonstrated  by 
measurements  of  the  optical  rotation,  must  then  be  considered  to  be  the 
consequence  of  this  hydrolytic  cleavage.  It  would  be  difficult  to  conceive  of 
a  mechanism  which  could  account  for  these  observations  if  trypsinogen  were 
to  consist  simply  of  an  open  polypeptide  chain.  However,  the  following  con- 
siderations suggest  that  the  molecule  is  not  of  such  a  simple  structure  :  (a)  the 
absence  of  a  reactive  C-terminal  group  ;  (b)  the  presence  of  6  intramolecular 
disulfide  bonds;  and  (c)  the  relatively  low  specific  levorotation  of  native 
trypsinogen  (  —  30°)  which  suggests  a  high  degree  of  helical  configuration. 

A  suggested  scheme'  representing  the  structural  changes  involved  in  acti- 
vation is  represented  in  Fig.  3,  which  incorporates  the  following  considera- 
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Fig.  3.  Proposed  scheme  for  the  conversion  of  trypsinogen  to  trypsin.  For  details  see 
the  text. 


tions:  (a)  The  C-terminal  region  of  the  molecule  is  not  involved  in  the 
activation  process  and,  hence,  is  omitted  from  this  diagram,  (b)  A  large 
fraction  of  the  trypsinogen  molecule  is  in  a  helical  configuration,  but  the 
A^-terminal  peptide  sequence  does  not  follow  the  helical  pattern,  presumably 
because  the  electrostatic  repulsion  between  the  four  adjacent  aspartic  acid 
residues  tends  to  keep  this  segment  in  a  more  or  less  extended  configuration. 


10]  ACTIVATION  OF  TRYPSINOGEN  173 

(c)  Interaction  between  the  aspartyl  carboxyl  groups,  electrostatically,  with 
positively  charged  groups,  or  by  hydrogen  bonding  with  groups  elsewhere 
along  the  chain,  provides  a  loop  which  is  destroyed  when  the  lysyl-isoleucine 
bond  is  broken  by  trypsin,  allowing  the  newly-formed  molecule  to  coil  up 
and  to  form  the  active  center  of  trypsin,  (d)  Two  of  the  six  disulfide  bridges 
of  trypsinogen  are  in  the  region  of  the  molecule  which  includes  the  catalytic 
site.  This  follows  from  studies  by  Dr  G.  H.  Dixon  in  our  laboratory  on  the 
tryptic  degradation  of  oxidized  diisopropylphosphoryl  (DlP)-trypsin,  which 
have  shown  that  the  largest  DIP-peptide  contained  five  cysteic  acid  residues 
among  a  total  of  55  amino-acid  residues,  (e)  The  active  center,  dehneated 
here  by  the  dotted  line,  includes  besides  the  catalytic  site,  the  grouping  con- 
ferring side-chain  specificity  to  the  enzyme,  as  denoted  by  X.  (/)  The  cata- 
lytic site  involves  a  histidine  and  a  serine  residue.  These  two  groupings  are 
separated  from  each  other  in  trypsinogen,  but  upon  activation,  come  suffi- 
ciently close  to  become  hydrogen-bonded  to  one  another.^ -^  This  conclu- 
sion is  based  primarily  on  kinetic  studies  of  the  reaction  of  trypsin  and 
chymotrypsin  with  acylating  agents,  such  as  /7-nitrophenyl  acetate  (reviewed 
in  ref.  9). 

An  essential  feature  of  this  scheme  is  that  the  splitting  of  the  lysyl-isoleucine 
bond  removes  a  structural  impediment,  which  allows  the  A^-terminal  sequence 
of  the  newly-formed  molecule  to  coil  up,  thereby  bringing  the  histidine  and 
serine  residues  into  juxtaposition.  According  to  this  picture,  cleavage  of  a 
bond  to  the  left  of  the  lysine  residue  might  not  cause  activation,  as  electro- 
static interaction  or  hydrogen  bonding  of  the  lysine  side  chain  would  still 
provide  a  certain  element  of  structural  rigidity  and  prevent  formation  of 
the  helix.  On  the  other  hand,  the  splitting  of  any  bond  to  the  right  of  the 
lysine  side  chain  would  produce  the  same  result  as  the  splitting  of  the  lysyl- 
isoleucine  bond.  According  to  this  interpretation,  the  structural  contribution 
of  two  or  more  hydrogen  bonds,  or  salt  linkages,  is  an  important  factor  in 
the  conversion  of  the  zymogen  to  the  active  enzyme,  and  were  it  possible 
to  rupture  these  bonds  specifically,  activation  of  trypsinogen  might  ensue. 
This  has  not  proved  possible  by  the  simple  expedient  of  denaturation  of 
trypsinogen  by  urea  ;  in  fact,  whereas  trypsin  is  still  active  in  3m  urea,  tryp- 
sinogen can  no  longer  be  activated  under  these  conditions,  even  by  the 
addition  of  trypsin,  presumably  because  other  structurally  important  hydro- 
gen bonds  are  dissolved  at  the  same  time. 

The  highly  specific  action  of  trypsin  in  hydrolysing  the  lysyl-isoleucine 
bond  may,  therefore,  be  viewed  as  both  a  case  of  limited  proteolysis  and  of 
limited  hydrogen-bond  rupture.  The  question  as  to  whether  the  hexapeptide 
is  actually  liberated  during  this  process  or  whether  it  remains  attached  and 
is  only  freed  after  addition  of  trichloroacetic  acid  has  not  yet  been  answered 
in  an  unequivocal  way,  but  all  evidence  thus  far  available  suggests  strongly 
that  it  is  liberated  as  the  lysyl-isoleucine  bond  is  broken. 

Details  of  the  present  results  will  be  pubhshed  elsewhere.^'' 
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Comments  on  the  structure  of  a-chymotrypsin 

B.  S.  HARTLEY 

School  of  Biochemistry,  University  of  Cambridge 


I  must  apologize  for  presenting  somewhat  preliminary  results  on  the  struc- 
ture of  a-chymotrypsin,  but  I  feel  I  might  be  able  to  supplement  some  of 
the  information  derived  from  Dr  Sorm's  peptides. 

I  have  been  studying  the  B-chain  of  performic  acid-oxidized  a-chymotrypsin 
which  was  first  isolated  by  Meedom  (Acta.  Chem.  Scand.,  10,  881  (1956)). 
It  contains  about  160  residues — approximately  two-thirds  of  the  enzyme — 
and  has  a  C-terminal  tyrosine  and  an  A^-terminal  Ileu.Val-sequence.  It  is 
especially  interesting  since  the  active  centre  of  chymotrypsin  probably  in- 
volves a  histidine  and  a  unique  serine  residue,  and  both  these  residues  are 
present  in  this  chain. 

Both  tryptic  and  chymotryptic  digests  of  the  B-chain  have  been  studied. 
Some  of  the  more  profitable  results  were  obtained  by  following  the  chymo- 
tryptic digestion  in  a  pH-stat  and  stopping  the  reaction  after  12-15  bonds 
were  spht.  The  fraction  of  the  digest  soluble  in  50%  ethanol  at  —5°,  pH 
3-6,  consisted  almost  entirely  of  16  neutral  and  basic  peptides,  which  were 
purified  by  paper  chromatography  and  electrophoresis.  The  composition 
and  sequence  in  these  peptides,  which  account  for  about  half  of  the  residues 
in  the  B-chain,  is  at  present  being  studied,  but  some  preliminary  results 
may  be  relevant  to  this  discussion. 

The  oxidized  B-chain  contains  two  arginine  residues  which  appear  in  the 
following  three  peptides  after  chymotryptic  digestion  : 

1.  Ala.  Arg.  (Thr,  Ala,  Val).  Leu 

2.  Ala.  Asp.  Thr.  (Asp,  Ala,  Pro).  Arg.   lieu.  (Glug,  Sera,  Ala,  Pro).  Leu 

3.  Ala.  Asp.  (Asp,  Thr,  Ala,    Pro).  Arg.  lieu 

It  seems  probable  that  peptide  3  arises  by  further  chymotryptic  digestion 
of  2.  A  tryptic  digest  of  the  B-chain  yielded  Ala.  Arg  and  Ala.  (Asp2,  Thr, 
Ala,  Pro).  Arg  as  the  only  peptides  containing  arginine,  confirming  that 
peptides  1.  and  2.  above  represent  the  environment  of  the  two  arginine 
residues  in  this  chain. 

A  further  peptide  which  may  be  of  interest  is  Ala.  (Ala,  Asp-NHg) — 
isolated  in  good  yield  from  the  chymotryptic  digest.  This  may  be  another 
example  of  an  asparaginyl  split  by  chymotrypsin,  such  as  was  mentioned 
by  Professor  Desnuelle,  though  one  has  to  be  wary  of  peptides  from  this 
oxidized  protein  because  of  the  unknown  fate  of  the  tryptophan  residues. 


Reaction  of  trypsin  with 
organophosphorus  esters 

T.  VISWANATHA 

Carlsberg  Laboratory,  Copenhagen,  Denmark 


The  relationship  between  protein  structure  and  biological  activity  has  been 
a  matter  of  considerable  discussion  in  recent  years.  Turning  our  attention 
to  specific  cases  of  proteolytic  enzymes,  the  opinion  as  to  the  nature  of  the 
active  site(s)  appears  to  be  sharply  divided.  Thus,  for  example,  in  the  case 
of  a-chymotrypsin,  photo-oxidation  studies  by  Weil  and  co-workers,^  the 
dye-binding  studies  of  Hartley  and  his  associates, ^  the  reaction  with  fluoro- 
dinitrobenzene  by  Whitaker  and  Jandorf,^  and  similar  studies  by  others,* 
suggest  the  association  of  an  imidazolyl  residue  on  the  enzyme  with  the 
active  site.  Evidence  to  the  contrary  have  been  reported  by  Neurath  and  his 
school^  and  by  Cohen  and  his  associates.^  The  ability  of  chymotrypsin  to 
react  with  diwopropyl  fluorophosphonate  (DFP)  has  been  used  in  most  of 
the  above-mentioned  studies  to  throw  some  light  on  the  nature  of  the  active 
site(s). 

Studies  on  the  reaction  of  trypsin  with  DFP  and  diethyl  j3-nitrophenyl 
phosphate  (E600)  carried  out  in  the  Carlsberg  Laboratory  have  led  to  the 
following  results  and  conclusions  : 

(I)  Both  DFP  and  E600  can  react  with  trypsin  in  the  presence  of  high 
concentrations  of  urea  resulting  in  the  incorporation  of  one  mole  of  phos- 
phorus per  mole  of  the  enzyme.  A  nearly  stoicheiometric  yield  of/7-nitrophenol 
(one  mole  per  mole  of  enzyme)  was  obtained  during  the  reaction  with  E600. 
The  unique  feature  of  the  reaction  in  the  presence  of  urea  is  that  the  enzyme 
maintains  its  ability  to  revert  to  the  active  form  on  appropriate  dilution. 
In  aqueous  medium  irreversible  inactivation  of  the  enzyme  accompanies  the 
reaction. 

(II)  The  reaction  of  trypsin  with  DFP  in  aqueous  medium  at  pH  8-0 
results  in  the  incorporation  of  2-0  moles  of  phosphorus  per  mole  of  the 
enzyme.  At  pH  more  alkaline  than  8-0,  more  than  2-0  moles  of  phosphorus 
per  mole  of  the  enzyme  are  introduced  into  the  protein.  While  the  first  2-0 
moles  of  phosphate  introduced  appear  to  be  stable  to  acid  treatment,  the 
additional  phosphate  groups  are  acid-labile  suggesting  their  attachment  to 
amino  groups  on  the  protein. 

The  presence  of  2-0  moles  of  phosphorus  has  been  investigated  further. 
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It  has  been  found  that  the  incorporation  of  the  first  phosphate  on  the  enzyme 
is  a  rapid  reaction  and  leads  to  inactivation.  The  second  phosphate  is  intro- 
duced at  a  much  slower  rate  than  the  first. 

E600  under  identical  conditions  yields  only  1-0  mole  phosphorus  per 
mole  of  the  enzyme.  It  was  noted  that  the  reaction  between  trypsin  and 
E600  occurs  at  a  much  lower  rate  compared  with  that  with  DFP.  This  obser- 
vation might  account  for  the  differences  in  the  phosphorus  incorporation 
data. 

(III)  Blocking  the  amino  groups  of  trypsin  by  acetylation  (73%  of  amino 
groups  blocked  as  shown  by  ninhydrin  determination)  impairs  the  ability 
of  the  enzyme  to  react  with  E600  in  the  presence  of  urea,  whilst  the  reaction 
in  aqueous  medium  can  still  occur. 

(IV)  The  inactive  precursor,  chymotrypsinogen,  could  be  phosphorylated 
by  DFP  either  in  aqueous  medium  or  in  the  presence  of  urea.  A  polyphos- 
phoryl  derivative  capable  of  activation  is  formed. 

(V)  A  soluble,  active  di/50propyl  phosphoryl  trypsin  (with  0-75  moles 
'P'  per  mole  and  81%  activity)  has  been  obtained.  All  the  above-mentioned 
results  suggest  that  an  active  enzyme  is  not  an  essential  pre-requisite  for  the 
reaction  with  DFP  and  E600.  These  results  and  conclusions  thereof  are  in 
apparent  disagreement  with  those  of  Harris  and  Hartley'  and  Neurath  and 
his  associates.^  The  observations  in  the  present  study  have  been  interpreted 
along  the  following  hnes: 

The  reaction  of  trypsin  with  DFP  or  E600  may  be  visualized  to  proceed 
by  two  independent  phosphorylation  mechanisms.  The  first  one  is  perhaps 
catalysed  by  an  imidazole  group  (or  some  other  group  in  the  case  of  E600) 
on  the  protein.  This  reaction  is  fast  and  leads  to  inactivation.  The  second 
phosphorylation,  which  is  a  slow  process,  appears  to  be  mediated  by  an 
amino  group.  A  common  feature  in  both  mechanisms  appears  to  be  a 
migration  of  the  phosphoryl  moiety  to  an  acid-stable  acceptor  group.  Whilst 
the  first  mechanism  ceases  to  operate  in  strong  urea  solution,  the  latter  is 
unaffected  by  such  changes  in  the  medium.  Acetylation  appears  to  eliminate 
the  second  mechanism.  The  second  phosphorylation  reaction  has  no  influ- 
ence on  the  enzymic  activity  if  carried  out  under  conditions  when  the  first 
one  is  not  operative  as  in  the  presence  of  urea. 

With  this  information,  efforts  were  next  concentrated  to  obtain  some  idea 
of  the  nature  of  the  active  site(s)  involved  in  the  reaction. 

The  results  obtained  are  summarized  below: 

(I)  Histidine  or  imidazole  fails  to  catalyse  the  hydrolysis  of  E600  at  pH 
7-6,  whilst  DFP  hydrolysis  is  enhanced  by  the  presence  of  histidine. 

(II)  Hydroxylamine  is  capable  of  enhancing  the  rate  of  DFP  and  E600 
hydrolysis. 

(III)  None  of  the  amino  acids  shows  any  catalytic  activity  towards  E600. 

(IV)  Studies  on  the  pH  optimum  for  the  inactivation  of  trypsin  by  the 
inhibitors  show  that  the  rate  of  inactivation  by  DFP  increases  with  pH  from 
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5-0  to  7-0,  and,  above  pH  7-0,  the  rate  is  independent  of  pH.  On  the  other 
hand,  in  the  case  of  E600  a  sharp  maximum  for  the  reaction  is  noticed  at 
pH  6-90.  The  latter  corresponds  to  the  pH  optimum  for  the  reactivation  of 
E600  inhibited  chymotrypsin  reported  by  Cunningham.^ 

The  difference  in  the  behaviour  of  DFP  and  E600  recorded  above  might 
be  interpreted  as  suggesting  the  involvement  of  different  groups  on  the 
enzyme  for  the  reaction  with  these  organophosphorus  esters.  It  might  also 
reflect  on  the  inadequacy  of  any  one  amino  acid  side  chain  on  the  protein 
molecule  to  account  for  the  catalytic  function  of  the  enzyme.  The  enzymic 
activity  might  be  related  to  the  presence  of  some  special  structure  or  spatial 
configuration  of  groups  in  the  protein  molecule. 

A  detailed  report  of  this  work  will  appear  in  Comptes  rendus  des  Travaux 
du  Laboratoire  Carlsberg,  Série  chimique. 
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Modification  of  pepsin  by  autodigestion 

GERTRUDE  E.  PERLMANN  AND 
MARY  J.  MYCEK* 

The  Rockefeller  Institute  for  Medical  Research,  New  York,  U.S.A. 


In  the  work  on  protein  structure  one  objective  has  been  the  discovery  and 
use  of  enzymes  that  attack  linkages  in  specific  configuration  thus  leaving 
the  protein  substrate  relatively  intact.  Although  the  action  of  such  enzymes 
may  consist  in  as  little  as  the  cleavage  of  one  bond — or  the  removal  of  a 
certain  group  of  the  molecule — considerable  changes  in  some  of  the  pro- 
perties of  the  protein  may  occur.  ^  Thus  it  has  been  possible  to  demonstrate 
that  the  enzymic  removal  of  the  pepsin-phosphorus  does  not  impair  the 
proteolytic  activity  of  the  protein  although  its  electrophoretic  mobility  is 
altered.  In  contrast  to  these  results  it  is  of  interest  that  through  the  action 
of  'pepsin  on  pepsin'  a  modified  form  of  the  protein  can  be  obtained.  Here, 
the  physico-chemical  properties  such  as  the  electrophoretic  behavior  and 
the  molecular  weight  have  not  changed  appreciably  but  the  relative  specific 
enzymic  activity  is  higher  than  that  of  the  parent  substance. 

As  shown  by  Steinhardt,^  pepsin,  if  stored  in  1-0  to  60  m  urea  at  3°C 
for  one  to  fifteen  days,  retains  its  enzymic  activity.  However,  a  marked  loss 
of  proteolysis  occurs,  if  pepsin  is  exposed  to  8-0  M  urea  at  temperatures 
above  20°C.^  Thus,  the  activity  of  a  0-5  per  cent  pepsin  solution  kept  in 
urea-acetate  of  pH  5-3  and  0-07  ionic  strength  for  24  hours  at  25°C,  is 
reduced  to  40  per  cent  of  its  initial  value,  whereas  at  37°C  almost  complete 
inactivation  takes  place.  The  rate  of  inactivaiion  is  a  function  of  the  urea 
concentration,  the  pH  and  the  ionic  composition  of  the  solvent.  In  addition 
to  the  loss  of  enzymic  activity  the  absorption  maximum  of  pepsin  in  the 
ultraviolet  is  shifted  from  2780  Â  to  the  shorter  wave  length  of  2760  Â  after 
treatment  with  urea.  It  emerges  from  these  results  that  the  action  of  the 
urea  on  pepsin  consists  in  the  rupture  of  hydrogen-bonded  phenolic  hydroxyl 
groups  of  the  tyrosine  residues.  The  protein  unfolds  slightly  and  a  number 
of  peptide  bonds  hitherto  not  accessible  to  enzymic  hydrolysis  are  now 
exposed  and  rapidly  hydrolyzed.  The  view  that  an  enzyme-catalyzed  auto- 
lysis is  the  underlying  phenomenon  is  further  supported  by  the  fact  that 
exposure  to  urea  produces  an  increase  in  the  solubility  of  pepsin  in  hot 
10  per  cent  trichloroacetic  acid.  The  formation  of  non-protein  material  is 
most  marked  in  the  pH  range  of  4-0  to  5-5. 

*  Public  Health  Service  Postdoctoral  Fellow  of  the  National  Cancer  Institute,  National 
Institutes  of  Health. 
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Since  at  any  given  stage  of  the  urea  inactivation  not  more  than  70  per 
cent  of  the  trichloroacetic  acid  soluble  material  is  dialysable,  samples  of 
pepsin  treated  with  8-0  m  urea  at  37°C  for  6,  12,  18  and  24  hours,  respectively, 
were  dialyzed  and  analyzed  electrophoretically.  As  indicated  in  Fig,  1,  the 

Time  of  Relative  specific  Zlectpophopetic 

exposure  to  urea  proteolytic  activity  composition 

in  hours  pep  unit  weight 

0  100  100  % 


88%  P 

6  51  1%  p. 

h.h    ->-.  5%  pj 

75  %P 
12  26  14%  p, 

11  %  p, 

67  %P 
18  15  17%  p, 

16%  Pj 

63  %P 
2-*  e  20  %  p, 

17%  pj 

Fig.  1.  Tracings  of  electrophoretic  patterns  of  0  7%  pepsin  solutions  in  sodium  acetate 
buffer  of  pH  4-6  and  0-1  ionic  strength.  Electrophoresis  was  carried  out  for  5400  sec.  at 
a  potential  gradient  of  6-4  volts  per  cm. 

loss  of  activity  is  accompanied  by  the  appearance  of  two  new  components, 
Pi  and  p2,  formed  at  the  expense  of  the  original  protein,  P.  The  relative 
specific  activity  of  the  mixture  which  has  been  exposed  to  urea  for  24  hours 
is  only  8  per  cent  of  the  original.  On  removal  of  the  low-molecular  weight 
material  by  dialysis,  the  relative  enzymic  activity  of  the  non-dialyzable  frac- 
tion, however,  is  similar  to  that  of  the  original  protein.  Hence,  the  question 
arises  whether  the  residual  activity  is  due  to  the  presence,  in  the  reaction 
mixture,  of  some  unchanged  pepsin. 

To  test  this  hypothesis  the  electrophoretic  components  of  pepsin  after 
treatment  with  urea  at  37°C  for  24  hours,  were  separated  by  zone  electro- 
phoresis on  a  Geon  resin.  As  shown  by  Fig.  2,  the  distribution  of  the 
electrophoretic  components  is  similar  to  the  pattern  obtained  in  free  elec- 
trophoresis (Fig.  1).  The  relative  concentration  of  Pi  and  P2,  however,  had 
decreased  considerably  owing  to  the  fact  that  this  sample  had  been  pre- 
dialyzed  against  a  hydrochloric  acid-glycine  buffer  of  pH  2-5  and  0-1  ionic 
strength  prior  to  the  dialysis  against  the  acetate  buffer  used  in  the  electro- 
phoresis experiment.  As  indicated  in  Fig.  2,  only  the  major  component  of 
the  mixture  with  a  mobility  similar  to  that  of  pepsin  had  proteolytic  activity. 

Although  the  sedimentation  constant  of  the  active  component,  ^'20  = 
2•8lXlO-^^  is  only  slightly  lower  than  that  of  pepsin,  S2o  =  2-9qX\0-^^,* 
the  following  differences  have  been  established:  The  relative  specific  activity 
per  unit  nitrogen  of  the  modified  protein  is  40  to  50  per  cent  higher  than 

*  The  authors  are  indebted  to  Dr  R.  Trautman  for  the  ultracentrifuge  measurements. 


10]  AUTODIGESTION  OF  PEPSIN  181 

that  of  the  parent  substance.  If  the  activity  is  expressed  per  unit  tyrosine, 
the  increase  is  80  to  100  per  cent.  The  hydrolysis  of  hemoglobin^  and 
of  the  synthetic  substrate  acetyl-L-phenylalanyl-diiodotyrosine^  are  affected 
to  the  same  extent.  Preliminary  amino  acid  analyses,  kindly  carried  out  by 
Dr  D.  Spackman,  revealed  differences  not  only  in  the  tyrosine  content  but 
also  in  that  of  several  other  amino  acids. 
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Fig.  2.  Zone  electrophoresis  on  Geon  resin  of  pepsin  treated  with  8  0m  urea  at  37°C 
for  24  hours.  Electrophoresis  was  carried  out  in  acetate  buffer,  pH  46  and  O-l  ionic 
strength  after  predialysis  in  glycine-HCl,  pH  2-5,  01  ionic  strength. 


The  present  observations  consist  of  a  few  experiments  only,  and  it  is  not 
possible  to  state  whether  the  urea  modified  pepsin  represents  a  mixture  of 
molecules  which  differ  from  each  other  by  small  changes  in  their  amino 
acid  composition  brought  about  by  autodigestion  or  whether  the  original 
pepsin  molecules  have  been  broken  down  to  smaller  units  which  reaggregate 
to  form  a  complex  of  a  molecular  weight  of  30,000  to  35,000. 

From  the  results  presented  here  it  can  be  concluded  that  the  apparent 
loss  of  the  enzymic  activity  of  pepsin  after  treatment  with  urea  is  caused 
by  an  enhanced  autodigestion.  At  any  given  stage,  however,  during  the 
inactivation  enzymically  active  modifications  of  the  parent  enzyme  exist 
which  differ  in  some  of  their  properties.  Although  an  enhancement  of 
activity  has  been  observed  it  has  not  yet  been  possible  to  change  the  speci- 
ficity of  the  enzyme. 
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The  successful  elucidation  of  the  structure  of  insulins  from  several  species, 
and  of  a  number  of  smaller  polypeptide  hormones  and  antibiotics  has 
stimulated  efforts  to  understand  the  relationship  between  protein  structure 
and  biological  activity.^  Enzymes  are  particularly  suited  for  such  investiga- 
tion inasmuch  as  their  activity  and  specificity  can  be  studied  in  vitro.  The 
structure  of  several  enzymes  is  now^  being  investigated  in  a  number  of 
laboratories,  and  it  is  likely  that  these  efforts  will  soon  be  successfully  com- 
pleted in  the  case  of  a  few  of  the  smaller  molecules.  However,  a  knowledge 
of  amino  acid  sequence  will  not  by  itself  solve  the  problem  of  correlating 
structure  and  function.  Much  more  information  will  be  necessary  to  achieve 
real  understanding  of  the  catalytic  properties  of  enzymes.  First  of  all,  the 
region  or  regions  of  the  protein  which  are  involved  in  specific  interaction 
with  the  substrates  must  be  identified  and  a  satisfactory  explanation  of  the 
substrate  specificity  must  be  at  hand.  Secondly,  the  kinetic  behavior  and 
overall  reaction  mechanism  must  be  expHcable  in  terms  of  the  enzyme  struc- 
ture. It  is  obviously  a  difficult  task  at  present  to  find  enzymes  of  suitable 
purity  and  availability  which  lend  themselves  to  all  of  the  necessary  types 
of  study  and  which  are  small  enough  to  permit  attacking  the  problem  of 
structure  by  presently  available  methods. 

In  our  laboratory,  we  have  chosen  a  proteolytic  enzyme,  papain,  for  in- 
tensive study  because  it  possesses  a  number  of  desirable  features  ;  however, 
it  has  some  disadvantages  as  well.  Among  the  many  advantages  which 
papain  possesses  is  the  fact  that  the  enzyme  can  be  readily  isolated  in  reason- 
able quantity  from  the  commercially  available  dried  papaya  latex.  ^  As  with 

*  The  work  reported  has  been  supported  in  large  part  by  grants  from  the  National 
Institutes  of  Health,  United  States  Public  Health  Service,  The  American  Cancer  Society 
and  the  Rockefeller  Foundation. 
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Other  proteolytic  enzymes,  synthetic  substrates  of  known  structure  can  be 
readily  prepared.  Because  of  the  good  stability  of  the  enzyme  under  a  variety 
of  conditions,  kinetic  studies  with  substrates  of  different  structure  can  be 
performed  and  such  studies  have  yielded  valuable  information  concerning 
the  nature  of  the  active  site  and  a  possible  mechanism  of  action  of  the 
enzyme.^-* 

One  of  the  most  important  features  of  papain  is  the  finding  that  a  single 
sulfhydryl  group  is  essential  for  activity  and  that  this  group  is  intimately 
concerned  in  the  enzymic  mechanism.^-^  This  presents  the  opportunity  of 
labeling  the  specific  part  of  the  peptide  chain  which  is  directly  involved  in 
the  interaction  with  substrates. 

This  brief  consideration  of  some  of  the  desirable  properties  of  the  enzyme 
should  not  serve  to  obscure  the  fact  that  a  protein  containing  approxi- 
mately 1 80  residues  poses  difficult  problems  for  study.  Although  satisfactory 
methods  appear  to  be  available  for  determining  the  amino  acid  sequence, 
it  is  obvious  to  all  who  are  engaged  in  such  studies  that  the  technical  prob- 
lems are  formidable  indeed  and  that  the  labor  involved  is  considerable. 

In  this  presentation  we  shall  discuss  some  of  the  studies  on  crystalline 
papain  which  indicate  the  approaches  which  are  being  used  in  our  labora- 
tory. Major  emphasis  will  be  given  to  studies  of  the  overall  structure  and  of 
attempts  to  determine  the  nature  of  the  active  region  of  the  molecule  by 
enzymic  degradation  of  the  molecule,  by  labeling  the  active  group,  and  by 
kinetic  analysis. 

GENERAL    PROPERTIES    AND    HOMOGENEITY 

Papain  has  a  molecular  weight  of  about  20,500  as  estimated  by  both  physical 
and  chemical  methods.  (Table  1.)  The  molecule  is  slightly  basic  with  an  iso- 

Table  1 
SOME   PROPERTIES    OF   PAPAIN 


Molecular  weight  (from  sedimentation-diffusion) 

20,700 

„             „      (from  amino  acid  content) 

20,300 

„             „      (from  crystalline  mercury  complex) 

1/2x41,400 

„             „      (from  maximum  mercury  binding) 

19,900 

Isoelectric  point  (0  1  ionic  strength) 

8-75 

S.20,W 

2-42  S 

Diffusion  constant  (x  10'  sq.  cm.  per  second) 

10-3 

Free  a-amino  groups  per  mole 

1  Isoleucine 

electric  point  at  pH  8-75  in  buffers  of  0-1  ionic  strength.  The  crystalline 
enzyme,  or  better,  its  crystalhne  mercury  derivative,  is  essentially  homo- 
geneous as  judged  by  sedimentation  studies  in  the  ultracentrifuge/  by  elec- 
trophoretic  analysis,^  and  by  end-group  determinations.'  Immunochemical 
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investigations^  show  that  the  crystalline  enzyme  is  a  good  antigen  and  the 
system  of  protein  and  rabbit  antiserum  shows  all  of  the  behavior  charac- 
teristic of  a  single  antigen  and  a  single  antibody.  Moreover,  since  the  anti- 
serum strongly  inhibits  the  enzymic  activity,  it  is  evident  that  the  enzyme 
itself  is  the  antigen.  In  essence,  all  of  these  studies  are  consonant  with  the 
view  that  crystalline  papain  is  molecularly  homogeneous.  Some  of  the  over- 
all properties  of  the  molecule  are  summarized  in  Table  1. 

THE    SULFHYDRYL    GROUP 
AND    SULFUR    DISTRIBUTION 

Papain  is  inhibited  by  all  of  the  agents  which  react  with  thiol  groups,  e.g., 
heavy  metal  ions,  organic  mercurials,  alkylating  agents,  and  oxidizing  agents. 
The  presence  of  an  essential  thiol  group  thus  presents  a  number  of  disad- 
vantages since  the  enzyme  is  readily  inactivated  by  traces  of  metal  ions  and 
by  oxidation  under  aerobic  conditions.  Full  enzymic  activity  is  obtained 
only  when  a  chelating  agent  and  a  reducing  agent  are  present. ^  Dimercapto- 
propanol  (BAL)  serves  in  both  capacities  and  is  a  convenient  compound 
to  use  in  kinetic  studies  of  the  enzyme.*-^ 

The  strong  affinity  of  the  enzyme  for  heavy  metal  ions  has  been  useful 
in  permitting  the  isolation  of  crystalline  mercuripapain  which  is  inactive 
and  is  thus  not  liable  to  autodigestion. ^  Moreover,  mercuripapain  is  more 
homogeneous  and  more  stable  than  crystalline  papain  itself.  Although  crys- 
talline mercuripapain  contains  one  atom  of  mercury  for  each  two  moles 
of  protein,  it  has  been  possible  to  demonstrate  that  a  molecule  of  reduced 
papain  can  bind  maximally  one  mercuric  ion.^^^ 

The  spectrophotometric  method  of  Boyer^''  has  been  employed  to  esti- 
mate the  affinity  of  reduced  papain  for;j-chloromercuribenzoate  (PCMB),^'^^ 
Fig.  1  shows  the  results  of  such  studies  with  different  preparations  of  the 
crystaHine  enzyme.  It  is  evident  that  in  each  case  less  than  one  thiol  group 
per  mole  of  papain  can  react  with  the  mercurial.  In  addition,  the  proteo- 
lytic coefficient  (Q)  for  the  hydrolysis  of  benzoyl-L-argininamide,  varies 
with  the  amount  of  active  SH.  It  should  be  noted  that  Balls  and  Lineweaver^^ 
originally  observed  that  different  preparations  of  the  crystaUine  enzyme  vary 
in  absolute  activity. 

The  finding  that  less  than  one  mole  of  reactive  SH  is  present  in  papain 
does  not  indicate  any  contamination  of  the  enzyme  with  other  proteins. 
On  the  contrary,  crystalline  preparations  of  the  enzyme  with  different  pro- 
teolytic activity  manifest  the  same  physical  properties,  are  homogeneous  and 
possess  the  same  amino  acid  composition.  The  explanation  for  the  varia- 
tion in  activity  and  reactive  thiol  resides  in  the  observations  that  preparations 
of  the  enzyme  or  its  mercury  derivative  undergo  an  irreversible  inactivation 
on  storage.  Indeed,  this  change  appears  to  occur  in  the  crude  latex  as  well, 
and  freshly  prepared  samples  of  crystalline  enzyme  manifest  this  variability.^ 
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The  linear  relationship  between  proteolytic  activity  and  reactive  thiol 
shown  in  Fig.  1  provides  strong  evidence  that  the  thiol  group  is  directly 
involved  in  the  catalytic  activity  of  the  enzyme.  It  may  be  noted  that  the 
thiol  group  of  papain  reacts  very  rapidly  with  PCMB.  Indeed,  the  reaction 
is  essentially  complete  in  less  than  two  minutes. 


SH~MOLES/MOLE   PROTEIN 


Fig.  1 .  Proteolytic  coefficient  (Cj)  for  hydrolysis  of  benzoyl-L-argininamide  as  a  func- 
tion of  sulfhydryl  reactive  with  /7-chloromercuribenzoate  for  different  preparations  of 
column-reduced  crystalline  papain.  (Finkle  and  Smith,  unpublished.) 

Less  extensive  studies  of  the  reaction  of  iodoacetamide  with  the  enzyme 
also  demonstrate  that  less  than  one  equivalent  of  this  substance  is  bound 
by  the  enzyme.^  Moreover,  this  reaction  produces  complete  and  irreversible 
inhibition  of  the  enzyme.  Reaction  of  the  alkylating  agent  with  a  cysteine 
residue  should  produce  a  thiol  ether  derivative  (Fig.  2).  Identification  of  this 


NoOH 
■SH-HICHgCONHg >- 


Nol 


-S-CH2CONH2 


HOOC-CHCH-S-CH,COOH 
(I)        0 


NH- 


Performic,  LgQ    u-M 
acid         (        3       HCl 


*-  HOOC  — CHCHoSOtH 


Fig.  2.  Reaction  of  iodoacetamide  with  a  sulfhydryl  group  and  subsequent  fate  after 
oxidation  and  hydrolysis  to  produce  ^-carboxymethylcysteine  sulfone  (I). 
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derivative  has  been  accomplished  by  the  following  procedure.^  The  reduced 
enzyme  was  allowed  to  react  with  one  equivalent  of  iodoacetamide.  The 
protein  derivative  was  then  exhaustively  dialyzed  and  the  preparation  was 
oxidized  with  performic  acid.  An  aliquot  was  hydrolyzed  with  6n-HC1  under 
the  usual  conditions  and  worked  up  in  the  conventional  manner.  Chromato- 
graphy of  a  sample  of  the  hydrolysate  showed  the  presence  of  carboxy- 
methylcysteine  sulfone  which  was  identical  in  behavior  with  that  of  an 
authentic,  synthetic  sample  of  this  compound.  Thus,  there  is  no  question 
that  the  reactive  thiol  group  is  present  on  a  residue  of  cysteine. 

The  aforementioned  studies  with  PCMB,  iodoacetamide  and  with  mercury^ 
are  in  accord  with  the  view  that  there  is  only  one  reactive  thiol  group  in 
the  enzyme.  However,  it  is  essential  to  ascertain  the  total  number  of  such 
groups  in  the  protein.  Preparations  of  papain,  treated  by  various  methods, 
react  with  more  than  one  mole  of  PCMB.  Under  certain  conditions,  it  has 
been  found  that  the  enzyme  will  react  with  6  moles  of  the  mercurial,  as 
judged  by  mercury  analyses,*-^  These  results  indicate  that  6  thiol  groups 
are  present  in  the  molecule.  Inasmuch  as  hydrolysates  of  oxidized  papain 
yield  6  moles  of  cysteic  acid,^^  the  agreement  between  these  results  suggests 
that  cystine  itself,  and  hence  that  disulfide  bridges,  are  absent  from  the 
papain  molecule. 

It  has  been  reported^^'^^  that  methionine  is  absent  from  papain.  Never- 
theless, elementary  analysis  reveals  that  the  molecule  contains  8  atoms  of 
sulfur^-^  and  the  presence  of  only  6  cysteine  residues  leaves  two  atoms  of 
sulfur  to  be  explained.  We  have  been  able  to  exclude  the  presence  of  inor- 
ganic sulfate,  acid-labile  sulfate  esters,  thiolhistidine,  or  other  known  amino 
acids  containing  sulfur.^  Thompson^^  could  not  detect  lanthionine  in  hydro- 
lysates of  papain.  It  is  our  hope  that  the  present  systematic  investigation 
of  the  structure  will  provide  eventually  an  understanding  of  the  nature  of 
the  two  missing  sulfur  atoms. 

STUDIES    OF    STRUCTURE 

Amino  terminal  groups.  With  the  dinitrophenyl  method  of  Sanger,^'  Thomp- 
son'^ demonstrated  that  mercuripapain  contains  only  one  free  a-amino  group, 
that  of  isoleucine.  From  partial  hydrolysates  of  the  dinitrophenyl  protein, 
he  isolated  the  unique  tripeptide,  DNP-isoleucylprolylglutamic  acid.  These 
findings  indicate  that  the  papain  molecule  consists  of  only  a  single  peptide 
chain.  In  addition,  Thompson  isolated  8  moles  of  e-DNP-lysine  which  is 
in  accord  with  the  number  of  lysine  residues  found  by  amino  acid  analysis.^* 

Peptides  containing  cysteic  acid.  Inasmuch  as  at  least  one  cysteine  residue  is 
of  primary  importance  in  the  catalytic  activity  of  papain,  a  study  was  made 
of  the  strongly  acidic  peptides  which  could  be  isolated  from  partial  acid 
hydrolysates  of  papain  which  had  been  oxidized  with  performic  acid.^^ 
Hydrolysis  of  the  oxidized  protein  was  performed  in  12N-HC1  at  .37°  for 
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7  days.  The  hydrolysate  was  worked  up  in  the  usual  manner  and  subjected 
to  electrophoresis  on  paper  in  acetic  acid  at  pH  2-3  or  passed  through 
Dowex-50x4  in  the  hydrogen  cycle.  The  strongly  acidic  fraction  was  ob- 
tained by  both  methods.  The  peptides  containing  cysteic  acid  could  then 
be  separated  by  two-dimensional  paper  chromatography.  The  peptides  iso- 
lated by  this  procedure,  whose  structure  has  been  completely  or  partly 
elucidated,  are  listed  in  Table  2.  Obviously,  these  few  peptides  do  not 
account  for  all  of  the  sequences  involving  cysteic  acid  which  must  be  pre- 
sent in  the  oxidized  protein,  probably  because  neutral  peptides,  i.e.,  those 
containing  cysteic  acid  and  a  basic  residue,  would  not  be  isolated  by  the 
methods  employed. 

It  is  evident  that  the  fragmentary  data  of  Table  2  give  little  information 
by  themselves  and  no  clue  is  obtained  as  to  which  one  of  these  sequences, 
if  any,  may  be  concerned  in  the  enzymic  activity. 

Amino  acid  composition.  The  composition  of  papain  is  given  in  Table  3.^* 
Estimation  of  most  of  the  amino  acids  was  performed  by  chromatography 

Table  2 

CYSTEIC   ACID    PEPTIDES    DEFINITELY 

IDENTIFIED    FROM    PARTIAL   HYDROLYSATES 

OF    OXIDIZED    PAPAIN" 


Asp.  CySOgH 
Ser.  CySOgH 
Val.  CySOgH 
Gly  (Pro,  CySOgH) 


CySOgH.  Asp 

CySOgH.  Gly 

CySOg.  Gly.  Asp 

Ser.  CySOsH.  CySOgH  (?) 


Table  3 
AMINO    ACID    COMPOSITION    OF    PAPAIN".^* 


Amino  acid 

Residues 

Amino  acid 

Residues 

Asp 

17 

Leu 

9 

Thr 

7 

Tyr 

17 

Ser 

11 

Phe 

4 

Glu 

17 

His 

1 

Pro 

9 

Lys 

8 

Gly 

23 

Arg 

9 

Ala 

13 

Tryp 

5 

Val 

15 

1/2  Cys 

6 

lieu 

9 

Amide 

(19) 

Total 

180 
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on  ion-exchange  columns  by  the  procedures  of  Moore  and  Stein. ^®  In  our 
studies,  ^^  we  used  different  times  of  hydrolysis  in  6n-HC1  in  order  to  cor- 
rect for  destruction  of  labile  amino  acids  and  to  produce  complete  libera- 
tion of  slowly  released  residues.  Tryptophan  was  estimated  by  a  colorimetric 
method^^  and  cysteine  was  estimated  chromatographically  after  hydrolysis 
of  the  oxidized  protein. ^^ 

The  composition  of  papain  shows  several  noteworthy  features:  the  ab- 
sence of  methionine,  the  presence  of  only  four  residues  of  phenylalanine 
and  one  of  histidine,  and  the  high  content  of  glycine  and  tyrosine. 

Tryptic  digestion  of  oxidized  papain.  The  most  useful  approach  to  the  study 
of  amino  acid  sequence  in  large  polypeptides  and  proteins  has  been  to  de- 
grade the  molecule  first  into  a  relatively  small  number  of  large  peptides. 
Experienced'-^  has  shown  that  this  type  of  degradation  can  best  be  accom- 
plished with  proteolytic  enzymes.  Selection  of  the  proteolytic  enzyme  de- 
pends upon  the  nature  of  the  protein  being  studied  and  the  specificity  of 
the  enzyme.  With  papain,  trypsin  appeared  to  be  the  enzyme  of  choice  for 
first  study.  Papain  contains  9  arginine  and  8  lysine  residues,  thus  providing 
17  sites  at  which  trypsin  can  act  and  this  would  yield,  theoretically,  18  pep- 
tides. The  amino  acid  composition  of  papain  is  such  that  other  proteolytic 
enzymes  could  be  expected  to  yield  a  larger  number  of  fragments. 

Papain  itself,  mercuripapain,  and  papain  combined  with  6  moles  of  PCMB 
are  relatively  resistant  to  the  action  of  trypsin. ^^  Furthermore,  each  of  these 
materials  is  potentially  capable  of  being  partially  activated  during  tryptic 
digestion,  thus  permitting  autolysis  and  inactivation  of  the  trypsin.  Auto- 
lysis is  particularly  undesirable  because  of  the  broad  specificity  of  papain. 
The  most  satisfactory  papain  derivative  for  degradation  by  trypsin  is  the 
performic  acid-oxidized  protein.  This  derivative  has  no  proteolytic  activity 
and  is  digested  by  trypsin  at  pH  7-5.  However,  oxidized  papain  is  insoluble 
at  this  pH  and  digestion  proceeds  slowly  for  several  days.  Oxidation  of 
papain  results  in  modification  of  the  5  tryptophan  residues  in  the  protein 
as  well  as  in  oxidation  of  cysteine  to  cysteic  acid  residues.^^ 

The  procedure  has  been  to  digest  oxidized  papain  at  pH  7-5  with  crystal- 
line trypsin  until  the  ninhydrin  color  of  an  aliquot  is  constant.  In  general, 
fresh  trypsin  has  been  added  every  10  to  20  hours.  In  no  case  did  the  trypsin 
content  of  the  digestion  mixture  exceed  2  per  cent  of  that  of  the  oxidized 
papain.  The  resulting  digest  is  dried  from  the  frozen  state  and  the  dry 
material  is  extracted  with  a  0-2  n  formate  buffer  at  pH  2-1.  The  soluble 
peptides  in  the  extract  are  resolved  by  chromatography.  The  large  amount 
of  insoluble  residue  must  be  fractionated  by  other  means. 

Chromatography  of  the  soluble  peptides  has  been  accomplished  on  a 
150  cm.  column  of  Dowex  50  x  2  with  a  gradient  elution  system.  The  elution 
pattern  and  the  pH  and  ionic  strength  gradients  are  shown  in  Fig.  3.^^  It 
should  be  noted  that  this  particular  chromatogram  was  obtained  on  a 
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column  7  mm.  in  diameter  and  represents  the  first  successful  attempt  to 
resolve  the  tryptic  digest  of  oxidized  papain.  Subsequent  chromatography 
was  performed  on  a  column  30  mm.  in  diameter  having  17  times  the  capacity 
of  the  original  column.  In  general,  the  transition  from  the  small  to  the  large 
column  has  resulted  in  only  minor  changes  in  the  elution  pattern. ^^ 
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Fig.  3.  Chromatographic  separation  of  soluble  peptides  found  in  a  tryptic  digest  of 
oxidized  papain.  The  ninhydrin-reactive  components  were  resolved  on  a  150  cm.  column 
of  Dowex-50x2  with  buffers  varying  in  ionic  strength  and  pH.  The  solid  bars  show  the 
fractions  which  were  pooled.  (Kimmel  and  Smith,  unpublished.)  O.D.  is  optical  density; 
upper  part  of  figure  :  full  lines,  pH  ;  broken  lines,  ionic  strength. 


Amino  acid  analyses  have  been  performed  on  all  of  the  fractions  numbered 
in  this  figure.  Most  of  the  fractions  up  to  Fraction  14  appear  to  contain 
single  peptides  or  mixtures  of  peptides  in  which  each  peptide  contains  only 
one  basic  amino  acid.  Fractions  14  through  18  contain  mixtures  of  large 
complex  peptides  with  more  than  one  basic  amino  acid.  Presumably,  these 
result  from  incomplete  tryptic  digestion,  although  it  is  possible  that  the 
peptide  bonds  involving  the  linked  carboxyl  groups  of  lysine  and  arginine 
in  these  fragments  are  resistant  to  trypsin,  as  in  a  lysylproline  sequence. 

Fractions  1,  2  and  3  in  this  chromatogram  emerge  as  sharp  symmetrical 
peaks  and  the  stoichiometry  of  the  amino  acid  analyses  suggested  that  these 
fractions  were  composed  of  nearly  pure  peptides.  Fraction  1  is  particularly 
interesting  since  analysis  showed  27  amino  acid  residues  including  2  resi- 
dues of  cysteic  acid.  No  basic  amino  acid  was  found  in  this  fraction  and, 
therefore,  it  was  assumed  that  fraction  1  represents  the  C-terminal  portion 
of  papain. ^^  Fraction  2  is  a  smaller  peptide  with  a  high  aspartic  acid  content. 

The  elution  pattern  shown  in  Fig.  3  was  obtained  on  5  different  occasions. 
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Subsequent  patterns  (Fig.  4)  have  shown  only  traces  of  ninhydrin-positive 
material  in  the  regions  where  Fractions  1,  2,  and  3  were  expected  and  ana- 
lyses of  the  small  peaks  in  these  areas  have  revealed  only  small  amounts  of 
material  with  free  cysteic  acid  predominating  in  the  region  of  Fraction  1 
and  free  aspartic  acid  in  the  region  of  Fraction  2.^°  In  later  portions  of  the 
chromatogram  reproducibility  has  been  much  better.  Fractions  5  through  16 
have  appeared  constantly  in  their  expected  positions.  The  complexity  of  the 
fractions  is,  however,  quite  variable.  Fraction  4  (Fig.  3),  for  example,  appears 
to  be  a  mixture  of  small  amounts  of  a  number  of  peptides  or  amino  acids, 
and  the  shape  of  the  curve  shown  here  is  certainly  suggestive  of  this.  Frac- 
tions 10  plus  1 1  (Fig.  4)  are  composed  of  at  least  three  peptides,  and  Fractions 
12  and  13  overlap  with  the  same  large  complex  peptide.  Fractions  5,  6,  8 
and  9  appear  to  be  constant  in  character. 


■     I     I i_ 
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Fig.  4.  Chromatographic  separation  of  peptides  in  a  tryptic  digest  of  oxidized  papain. 
Except  for  the  absence  of  Fractions  1  to  3,  the  results  were  similar  to  those  shown  in 
Fig.  3.  (From  Kimmel  and  Smith,  unpublished.) 


Table  4  shows  the  composition  of  the  peptides  that  have  been  isolated 
and  shown  to  be  pure.^"  Unknown  sequences  are  enclosed  in  parentheses. 
It  is  interesting  that  two  of  the  sequences  shown  have  been  encountered 
in  other  proteins  and  polypeptides.  The  sequence  serylarginine  has  been 
found  in  chymotrypsin^^  and  in  ribonuclease.^^  The  very  unusual  sequence 
ala.ala.ala.lys.  is  also  present  in  ribonuclease.^^ 

It  is  obvious  that  the  peptides  shown  in  Table  4  represent  less  than  one 
third  of  the  papain  molecule.  The  yield  of  these  peptides  is  not  high,  ranging 
from  20  to  40  per  cent,  but  significant  enough  to  indicate  that  they  repre- 
sent authentic  fragments  from  oxidized  papain.  Several  of  the  fractions 
shown  in  Figs.  3  and  4  are  known  to  be  mixtures  of  peptides,  and  attempts 
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Table  4 

SOME   PURE   PEPTIDES    ISOLATED    FROM    A   TRYPTIC    DIGEST 
OF    OXIDIZED    PAPAIN 


Number  of 

Fraction 

Amino  acid  composition 

residues 

1 

KCyS03H)2,  Aspa,  Thr.  Serg,  Glug,  Pro,  Glyj,  Ala, 

Vala,  lieu,  LeUg,  TyrJ 

27 

5 

Gly.Gly.Ileu.Phe.Val.  Gly.Pro.CySOgH. 

Asp.Gly.Lys 

11 

6 

(Asp,  Sera,  Glug,  Pro,  Gly,  Alag,  Valo,  Heu,  Leu)  Lys 

14 

8a 

(Asp,  Thra,  Ser,  GIu,  Glyg,  Alag,  Vala)  Lys 

13 

8b 

Gly.  Ala.  Val.  Thr.  Pro.  Val.  Lys 

7 

10a 

Ala.Ala.Ala.Lys 

4 

12 

Ser.Arg 

2 

12a 

[(CyS03H)2,  Ser,  Tyra,  Argg] 

7 

13 

Ileu.Lys 

2 

are  being  made  to  resolve  these  mixtures  by  other  methods.  For  example. 
Fraction  10a  has  been  isolated  from  Fraction  10  plus  11  by  ionophoresis 
on  paper  and  two  peptides  (8a  and  %b)  have  been  separated  by  ionophoresis 
from  Fraction  8, 

The  discussion,  thus  far,  has  been  concerned  with  the  acid-soluble  pep- 
tides resulting  from  tryptic  digestion  of  oxidized  papain.  The  insoluble  resi- 
due, which  is  more  difficult  to  fractionate  by  chromatography,  has  not  been 
completely  resolved.  This  crude  fraction  is  of  great  interest  since  it  appears 
to  contain  the  sulfhydryl  group  which  reacts  with  iodoacetamide. 

A  sample  of  papain  has  been  treated  with  iodoacetamide,  dialyzed  free 
of  unreacted  reagent,  and  oxidized  with  performic  acid  as  before.  The  oxi- 
dized protein  was  then  digested  with  trypsin  and  chromatographed  with  the 
eluting  system  shown  above.  The  chromatogram  obtained  resembled  the 
one  shown  in  Fig.  4.  Each  fraction  from  the  soluble  portion  of  the  digest 
was  hydrolyzed  with  acid  and  examined  for  the  presence  of  carboxymethyl- 
cysteinesulfone.  None  of  these  fractions,  pure  or  impure,  known  to  contain 
cysteic  acid  contains  this  derivative.  However,  the  insoluble  fraction  con- 
tained both  cysteic  acid  and  the  sulfone.  These  preliminary  findings  pro- 
vide strong  evidence  that  iodoacetamide  does  not  react  in  a  random  manner 
with  all  the  sulfhydryl  groups  of  papain.  On  the  contrary,  the  results  indi- 
cate that  only  a  single  sulfhydryl  group  in  the  papain  molecule  reacts  with 
the  alkylating  agent.  Attempts  to  isolate  and  identify  the  peptide  contain- 
ing the  carboxymethylcysteinesulfone  have,  so  far,  not  been  successful.  The 
insoluble  residue  appears  to  contain  several  components  and  the  limited 
solubility  of  these  peptides  in  acid  solution  presents  a  particularly  difficult 
problem  of  resolution. 
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In  concluding  the  present  phase  of  this  presentation,  it  may  be  noted 
that  the  problem  of  dealing  with  the  tryptic  peptides  from  a  protein  as 
large  as  papain  is  far  from  simple.  Chromatography  by  itself  has  given  us 
very  few  pure  peptides  from  the  tryptic  digest  of  oxidized  papain.  In  fact, 
we  have  been  badly  misled  on  some  occasions  in  that  good  stoichiometry 
has  been  obtained  on  amino  acid  analysis,  only  to  find  that  several  amino 
end-groups  were  present  as  shown  by  the  dinitrophenyl  method  or  that 
several  components  could  be  identified  by  ionophoresis  on  paper. 

DEGRADATION    OF    MERCURIPAPAIN 
BY    LEUCINE    AMINOPEPTIDASE 

The  isolation  of  leucine  aminopeptidase  in  essentially  homogeneous  form 
and  demonstrably  free  of  other  peptidases  and  of  proteinases^*  has  per- 
mitted investigations  of  the  action  of  this  exopeptidase  on  polypeptides  and 
proteins.  In  brief,  we  have  been  able  to  show  that  the  aminopeptidase  is 
restricted  in  its  hydrolytic  action  to  peptide  bonds  adjacent  to  a  free  a-amino 
group,  regardless  of  the  size  of  the  substrate.  This  has  been  demonstrated 
not  only  with  small,  synthetic,  compounds^^  but  also  with  the  oxidized  A 
and  B  chains  of  insulin,  with  zinc-free  insulin,  with  oxidized  ribonuclease 
and  a  number  of  other  susceptible  proteins.  2^"  ^^  In  no  case,  was  it  possible 
to  obtain  evidence  of  a  hydrolytic  action  at  any  locus  other  than  a  peptide 
bond  at  the  amino-terminal  end  of  the  substrate  molecule.  As  an  illustra- 
tion of  these  findings,  it  may  be  noted  that  the  aminopeptidase  has  been 
successfully  used  to  determine  the  sequence  of  the  first  six  residues  at  the 
iV-terminal  end  of  the  oxidized  B-chain  of  insulin.  ^^ 

The  known  specificity  of  the  aminopeptidase  permits  use  of  the  enzyme, 
not  only  for  sequence  studies,  but  for  other  types  of  investigations  as  well. 
Differentiation  between  glutamyl  and  aspartyl  residues  from  their  respec- 
tive cü-amides  is  simple  since  the  enzyme  acts  only  on  a-peptide  or  a-amide 
bonds.  Moreover,  when  an  optically  active  amino  acid  residue  is  liberated, 
it  must  be  of  the  l  configuration,  since  the  aminopeptidase  does  not  release 
A^-terminal  d  residues.  This  specificity  also  allows  the  enzyme  to  be  used 
for  resolution  of  DL-a-amino  acid  amides.  It  may  be  emphasized  that  we 
have  been  unable  to  find  any  measurable  transferase  or  synthetic  action  by 
the  enzyme,^"  findings  which  insure  unequivocal  results  in  structural  studies. 

The  sharp  specificity  of  the  aminopeptidase  permits  studies  of  the  rela- 
tionship between  the  structure  of  the  amino-terminal  end  of  a  susceptible 
polypeptide  or  protein  and  the  biological  activity  of  the  substance.  The 
present  section  is  devoted  to  a  study  of  the  action  of  the  aminopeptidase 
on  mercuripapain. 

Active  papain  cannot  be  employed  as  a  substrate  for  the  aminopeptidase, 
since  the  latter  is  rapidly  digested  and  inactivated  by  the  proteinase ;^^  how- 
ever, mercuripapain  is  essentially  inert.  It  has  already  been  reported  briefly  ^"^ -^^ 
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that  the  aminopeptidase  liberates  a  large  quantity  of  amino  acids  from 
mercuripapain,  the  extent  of  degradation  depending  on  the  ratio  of  enzyme 
to  substrate.  The  data  of  Table  5  indicate  that  as  much  as  two  thirds  of  the 
amino  acid  residues  of  mercuripapain  can  be  liberated.  What  is  most  strik- 
ing is  that  in  each  case  the  enzyme,  after  removal  of  mercury,  still  retains 
its  activity  towards  benzoyl-L-argininamide  (Table  5).  These  results  indicate 

Table  5 

DEGRADATION    OF    MERCURIPAPAIN    BY    LEUCINE 
AMINOPEPTIDASE 

Mg++-activated  aminopeptidase  {E)  was  allowed  to  act  on  mercuripapain  iS)  for  24 
hours  at  the  pH  indicated.  The  molar  ratio  of  5  to  £  is  indicated.  Extent  of  degradation 
was  estimated  by  the  increment  in  ninhydrin  color  expressed  as  leucine  equivalents. 
Activity  was  estimated  with  benzoyl-L-argininamide  as  substrate  as  previously  described.^ 
Activity  of  degraded  mercuripapain,  after  activation,  given  as  per  cent  of  control  before 
digestion.  From  Hill  and  Smith.^s 


SioE 

pH 

Moles  of  amino  acid  liberated 

Relative  activity 

150 

8  0 

18 

100 

140 

8-5 

27 

88 

110 

8  0 

23 

100 

110 

8-6 

36 

100 

40 

8-5 

50 

104 

10 

8-5 

90 

89 

5 

8-5 

120 

99 

that  the  active  site  of  papain  must  be  remote  from  the  TV-terminal  region 
of  the  molecule. 

In  documenting  the  above  observations,  it  is  obviously  important  to  de- 
monstrate that  only  amino  acids  are  liberated  and  that  these  are  derived 
entirely  from  the  A^-terminal  end  of  mercuripapain,  as  expected  from  the 
specificity  of  the  aminopeptidase  and  from  experience  with  other  polypep- 
tides and  proteins.  Secondly,  it  is  desirable  to  study  the  residual  mercuri- 
papain itself  to  determine  that  the  active  molecule  differs  from  the  original 
protein.  Thirdly,  we  have  attempted  to  ascertain  whether  the  specificity  of 
the  degraded  papain  is  altered.  It  may  be  recalled  that  Perlmann^^  found 
that  the  activity  of  pepsin  fragments,  obtained  by  autolysis,  differed  for 
protein  and  peptide  substrates  as  compared  with  the  intact  enzyme. 


Free  amino  groups.  As  already  noted  above,  the  DNP  amino  acids  found 
after  hydrolysis  of  intact  DNP-mercuripapain  are  one  equivalent  of  DNP- 
isoleucine  and  8  of  e-DNP-lysine.  Table  6  gives  the  findings  with  mercuri- 
papain hydrolyzed  by  the  aminopeptidase  with  different  numbers  of 
residues  released.  It  is  evident  that  the  mercuripapain  must  be  hydrolyzed  at 
Nps 
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its  jV-terminal  end,  both  because  yV-terminal  isoleucine  disappears  and  be- 
cause new  end  groups  are  found.  Inasmuch  as  the  hydrolysis  by  the  amino- 
peptidase  proceeds  in  a  stepwise  manner,  it  is  to  be  anticipated,  as  the  results 
in  Table  6  demonstrate,  that  a  more  and  more  complex  mixture  of  end 
groups  will  appear  as  more  breakdown  occurs.  For  this  reason,  study  of 
end  groups  formed  beyond  the  level  indicated  in  Table  6  has  not  been 
pursued. 

Table  6 
AMINO    END    GROUPS    OF    DEGRADED    MERCURIPAPAIN 


Preparation 

Extent 

of 

degradation 

Treatment  of 
DNP-protein 

DNP-amino  acids 
identified 

Moles  per 
mole  of 
protein 

I 

21 

6N-HC1;  110°;  18  hours 

DNP-Asp 

DNP-Ala 

DNP-Phe 

e-DNP-Lys 

* 
* 
* 
6-9 

Oxidized,  6n-HC1; 
110°,  18  hours 

DNP-CySOgH 
€-DNP-Lys 

0 

6-3 

12N-HC1;  16  hours; 
110° 

DNP-Gly 
DNP-Pro 

trace 
0 

II 

28 

6N-HC1;  24  hours; 
110° 

DNP-Asp 
DNP-Ala 
DNP-Phe 
DNP-Arg 

008 
0  04 
0-21 
0-50 

III 

36 

6N-HC1;  24  hours; 
110° 

DNP-Asp 
DNP-Val 
DNP-Ser 
DNP-Glu 
DNP-Ala 

0-38 
0-31 
018 
trace 
trace 

*  indicates  present,  but  not  quantitatively  estimated. 

Amino  acids  liberated.  At  various  stages  of  hydrolysis  of  mercuripapain  by 
the  aminopeptidase,  amino  acid  analyses  have  been  performed.  The  free 
amino  acids  were  collected  by  dialysis  and  analyzed  on  ion-exchange  columns 
by  the  older  method  of  Moore  and  Stein^^  or  by  a  modification  of  the  newer 
method  of  Spackman,  Moore  and  Stein. ^^  In  no  case  was  there  any  evidence 
for  the  presence  of  unexpected  peaks  which  might  be  due  to  liberated  pep- 
tides. Representative  results  at  two  stages  of  degradation  are  given  in 
Table  7.  Other  results  are  presented  in  Table  8  and  will  be  discussed  below. 
It  should  be  emphasized  that  at  no  point  does  the  relative  composition 
of  the  liberated  amino  acids  resemble  that  of  papain  or  of  the  aminopepti- 
dase. The  data  indicate  that  the  composition  of  the  iV-terminal  portion  of 
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Table  7 

AMINO    ACIDS    LIBERATED    FROM    MERCURIPAPAIN    BY 
AMINOPEPTIDASE 

Preparation  I = approximately  19  residues  released;  Preparation  11= approximately  72 
residues  released 


Moles  per  mole  of  mercuripapain 

Aminn  aoiH 

Whole  papain 

i^JllUlV/   civiu 

Preparation  I 

Preparation  II 

Cysteic  acid 

+ 

10 

6 

Aspartic  acid 

10 

4-1 

17 

Threonine 

0-9 

2-8 

7 

Serine  and  asparagine 

2  0 

7-1 

11 

Glutamic  acid 

10 

9-2 

17 

Glycine 

21 

6-7 

23 

Alanine 

20 

9-2 

14 

Valine 

1-8 

4-5 

15 

Isoleucine 

10 

6-4 

9 

Leucine 

1-3 

60 

9 

Tyrosine 

1-3 

2-7 

17 

Phenylalanine 

0-7 

1-2 

4 

Arginine 

10 

4-4 

9 

Lysine 

10 

6-6 

8 

Proline 

+ 

+ 

9 

Tryptophan 

00 

[10] 

5 

tlie  papain  molecule  is  different  from  that  of  the  C-terminal  region.  It  is 
striking  that  much  of  the  threonine,  alanine,  leucine  and  isoleucine  are  in 
the  first  72  residues,  whereas  most  of  the  tyrosine  and  valine  residues  appear 
to  be  in  the  remainder  of  the  molecule.  It  is  particularly  noteworthy  that  only 
one  of  the  six  cysteine  residues  has  been  located  in  the  A^-terminal  region. 
Obviously  this  cannot  be  the  enzymically  active  residue. 

It  is  noteworthy  that  when  19  residues  are  released,  there  appears  to  be 
excellent  stoichiometry,  indicating  a  rather  uniform  hydrolysis  of  all  mole- 
cules. In  contrast  to  this,  when  approximately  72  residues  are  released,  the 
breakdown  is  no  longer  uniform  in  accord  with  results  obtained  by  the 
DNP  method  on  residual  mercuripapain. 

Although  we  have  performed  analyses  of  the  liberated  amino  acids  be- 
yond the  stage  given  in  Table  7,  we  are  uncertain  as  to  the  significance  of 
these  results  at  the  present  time.  There  is  some  suspicion  that  slight  activa- 
tion of  mercuripapain  may  occur  in  the  presence  of  the  large  amounts  of 
aminopeptidase  required  for  such  extensive  degradation.  Moreover,  the 
large  amounts  of  free  amino  acids  may  cause  also  activation  of  the  pro- 
teinase. Until  such  results  are  adequately  controlled  we  prefer  to  present 
only  the  results  of  experiments  in  which  it  is  certain  that  such  complicating 
factors  do  not  occur. 
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One  of  the  best  methods  of  checking  the  results  of  degradation  experi- 
ments is  to  be  able  to  add  up,  for  a  given  experiment,  the  amino  acids 
released  and  the  amino  acids  found  in  the  undegraded  residual  mercuri- 
papain.  Thus  far,  only  two  such  experiments  have  been  performed.  In 
Experiment  I,  the  liberated  amino  acids  v^ere  collected  by  dialysis  and 

Table  8 

ANALYSIS    OF    AMINO    ACIDS    LIBERATED    AND    THOSE 

REMAINING    IN    MERCURIPAPAIN    AFTER    HYDROLYSIS    WITH 

AMINOPEPTIDASE    (LAP) 

In  Experiment  I,  the  undegraded  residual  mercuripapain  was  collected  by  isoelectric 
precipitation.  In  Experiment  II,  the  degraded  material  was  isolated  by  the  chromato- 
graphic procedure  shown  in  Fig.  5. 


Amino  acid 

Experiment  I 

Experiment  II 

Sum 

Sum 

Com- 
position 

of 
Papain 

Left 

Left 

Liberated 
by  LAP 

in  frag- 
ment 

Liberated 
by  LAP 

in  frag- 
ment 

I 

II 

Cysteic  acid 

11 

4-8 

0-9 

5  0 

5-9 

5-9 

6 

Aspartic  acid 

2-7 

13-6 

5-7 

12-1 

16-3 

17-8 

17 

Threonine 

1-7 

4-7 

2-5 

4-3 

6-4 

6-8 

7 

Serine 

2-9 

8-2 

2-4 

7-8 

HI 

10-2 

11 

Glutamic  acid 

2-4 

12-6 

6  0 

11-2 

150 

17-2 

17 

Proline 

— 

— 

(2-9)* 

61 

— 

(9-0) 

9 

Glycine 

4-4 

15-6 

5-1 

18-3 

20  0 

23-4 

23 

Alanine 

4-3 

9-2 

51 

8-3 

13-5 

13-4 

13 

Valine 

3-1 

12-7 

2-8 

9-2 

15-8 

12-0 

15 

Isoleucine 

1-9 

7-0 

2-4 

4-9 

8-9 

7-3 

9 

Leucine 

2-5 

7-9 

3-5 

5-8 

10-4 

9-3 

9 

Tyrosine 

1-9 

10-3 

— 

— 

12-2 

— 

17 

Phenylalanine 

1-0 

3-2 

11 

2-6 

4-2 

3-7 

4 

Arginine 

20 

7-8 

2-0 

6-0 

9-8 

8-2 

9 

Lysine 

1-3 

5-6 

4-2 

5-6 

6-9 

9-8 

8 

Histidine 

00 

10 

00 

0-8 

10 

0-8 

1 

Tryptophan 

0-7 

— 

— 

— 

— 

— 

5 

Total  Residues 

34 

124 

47 

108 

158 

155 

180 

Corrected  Total 

172t 

178t 

*  Assumed  value. 

t  For  comparison  with  intact  papain,  the  total  number  of  tyrosine,  tryptophan  and 
proline  residues  has  been  assumed  to  be  the  same  as  in  the  original  papain. 


analyzed  as  described  above.  A  separate  aliquot  was  oxidized  with  per- 
formic  acid  for  determination  of  cysteic  acid.  The  residual  mercuripapain 
was  then  precipitated  isoelectrically,  oxidized  with  performic  acid,  hydro- 
lyzed  with  6  N-HCl  in  the  usual  manner,  and  the  hydrolysate  was  analyzed. 
The  results  are  given  in  Table  8. 


11]  STRUCTURE   AND   ACTIVITY  OF   PAPAIN  197 

More  recently,  a  method  has  been  developed  for  separating  chromato- 
graphically,  inactivated  aminopeptidase  from  mercuripapain  or  from  de- 
graded mercuripapain.^^  The  chromatography  is  shown  in  Fig.  5.  The  main 
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TUBE  NUMBER 

Fig.  5.  Chromatographic  separation  of  the  proteins  in  an  inactivated  preparation  of 

leucine  aminopeptidase  from  mercuripapain  (upper  figure)  or  degraded  mercuripapain 

(minus  approximately  50  residues)  (lower  figure).  A  1-3  x  40  cm.  column  of  IRC-50  was 

used  at  pH  6-3,  initially  with  0-1  m  phosphate  buffer  and  then  with  0-2  m  phosphate.  (Hill 

and  Smith,  unpublished.) 


feature  of  the  chromatography  on  IRC-50  is  that  inactivated  aminopepti- 
dase and  contaminating  proteins  of  the  aminopeptidase  preparation  are 
readily  eluted  with  0-1  m  phosphate  buffer  at  pH  6-3,  whereas  the  mercuri- 
papain or  degraded  material  is  quantitatively  absorbed.  Phosphate  buffer 
at  0-2  M  is  then  used  to  displace  the  mercuripapain.  Although  neither  the 
inert  mercuripapain  nor  the  degraded  material  shows  symmetrical  peaks, 
it  is  evident  that  there  is  excellent  agreement  between  the  curves  for  relative 
activity  and  for  ninhydrin  color.  This,  in  effect,  demonstrates  the  absence 
of  significant  amount  of  inert  material. 

The  contents  of  the  tubes  containing  the  degraded  material  were  pooled 
and  worked  up  by  conventional  methods.  The  mixture  was  then  hydrolyzed 
with  6N-HC1  and  oxidized  with  performic  acid.  Unfortunately,  this  pro- 
cedure destroys  the  tryptophan  and  much  of  the  tyrosine,  but  values  were 
obtained  for  the  other  amino  acids.  Values  for  tyrosine  and  tryptophan  will 
have  to  be  estimated  in  other  experiments. 

In  order  to  compare  the  results  of  Experiment  II  on  the  hydrolysate  of 
residual  papain  with  the  analyses  for  the  liberated  amino  acids,  it  has  been 
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assumed,  as  found  earlier  (Table  7),  that  only  one  residue  of  cysteic  acid  is 
found  among  the  oxidized  free  amino  acids  and  five  cysteic  acid  residues 
remain  (after  oxidation)  in  the  residual  material.  The  results  of  both  experi- 
ments at  two  levels  of  degradation,  which  are  given  in  Table  8,  indicate 
that,  within  experimental  error,  there  is  excellent  agreement  between  the 
sum  of  the  residues  liberated  and  those  recovered  in  the  residual  protein, 
and  with  the  previously  reported^^  analysis  of  papain.  It  should  be  noted 
that  the  absence  of  methionine  or  its  sulfone  from  oxidized  samples,  and  the 
expected  low  recovery  of  histidine  and  phenylalanine  are  consonant  with 
the  view  that  all  the  materials  analyzed  are  derived  from  the  mercuripapain 
and  not  from  the  aminopeptidase. 

Specificity.  One  of  the  major  reasons  why  the  intensive  study  of  papain 
was  undertaken  is  that  a  variety  of  synthetic  substrates  of  known  structure 
can  be  studied. ^  It  would  appear  from  general  experience  that  the  specificity 
of  enzyme-substrate  interaction  can  be  more  readily  studied  under  these 
circumstances  than  when  the  enzyme  is  limited  in  its  action  to  one  or  a  few 
compounds. 

It  was  of  great  interest  to  study  the  relative  specificity  of  papain  at  various 
stages  of  degradation.^2-^^  The  results  with  several  synthetic  substrates  of 
widely  differing  structure  are  given  in  Table  9  and  the  data  obtained  with 

Table  9 

SPECIFICITY   OF    DEGRADED    PAPAIN 

A  sample  of  Mg++-activated  aminopeptidase  (21  mg.,  Ci=60)  was  incubated  at  40°  and 
pH  8-6  with  14-3  mg.  of  mercuripapain.  Aliquots  were  removed  at  various  intervals  and 
the  extent  of  degradation  was  estimated  by  the  ninhydrin  method.  Other  aliquots  were 
tested  with  the  substrates  shown.  Each  test  mixture  contained  0-05  m  substrate,  0-005  M 
cysteine,  0-001  m  ethylenediaminetetraacetate  and  0-05  m  acetate  buffer  at  pH  5-2.  Hydro- 
lysis was  estimated  by  titration  with  0-01  n-KOH  in  90  per  cent  ethanol.  The  Cx  was 
calculated  on  the  basis  of  initial  papain  concentration.  Optically  active  substrates  were 
of  the  L  configuration. 


Residues 
released 
per  mole 
of  papain 

Ci 

Bz.Arg.NH2 

Cbz.Glu(NH2)2 

Cbz.Leu.NH2 

Cbz.His.NH2 

Bz.Gly.NH2 

0 

29 

55 
70 

2-0 
21 
1-8 
2-2 

0-59 
0-56 
0-56 
0-52 

0-048 
0-048 
0-047 
0-048 

0-033 
0-030 
0-024 
0040 

0-044 
0-042 
0-035 
0-042 

human  serum  albumin  as  substrate  are  shown  in  Fig.  6.  These  findings 
demonstrate  that,  within  experimental  error,  no  change  in  substrate  speci- 
ficity occurs  when  as  many  as  70  of  the  180  residues  of  papain  are  removed 
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from  the  enzyme.  In  essence,  these  results  support  the  view,  expressed  ear- 
]jgj.^36-38  j-j^^j-  substrate  specificity  and  catalytic  efiectiveness  are  part  of  the 
same  phenomenon,  namely,  that  each  locus  of  interaction  on  the  enzyme, 
which  contributes  to  the  specificity,  plays  a  role  in  the  catalytic  transforma- 
tion of  the  substrate.  Perhaps  this  is  just  another  way  of  stating  that  the 
key  to  enzymic  action  resides  in  the  formation  of  the  enzyme-substrate 
complex,  a  problem  which  will  be  also  discussed  below. 
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Fig.  6.  Action  of  papain  and  degraded  papain  (minus  approximately  70  residues)  on 
human  serum  albumin  tested  at  40°  and  in  0  005  m  dimercaptopropanol  at  pH  5-2  in 
acetate  buffer.  (Hill  and  Smith,  unpublished.) 


AUTODIGESTION    OF    PAPAIN 

Recent  studies^^  have  shown  that  crystalline  mercuripapain  or  papain  can 
be  readily  chromatographed  on  IRC-50  in  neutral  or  slightly  acidic  0-2  m 
phosphate  buffers.  At  pH  7-0  the  protein  appears  to  be  homogeneous, 
whereas  at  pH  6-2  small  amounts  of  material  are  found  which  appear  in 
advance  of  the  main  active  component.  On  occasion,  some  of  the  faster 
material  showed  some  proteolytic  activity,  particularly  when  stored  papain 
was  used  for  the  chromatography.  Inasmuch  as  we  had  known  that  stored 
papain  undergoes  some  autodigestion,  a  solution  of  active  papain  was 
allowed  to  autolyze  for  a  period.  When  this  solution  was  chromatographed, 
as  shown  in  Fig.  7,  it  was  found  that  the  original  active  peak  had  diminished 
in  quantity  and  other,  more  rapidly  eluted,  fractions  appeared.  It  is  striking, 
however,  that  a  considerable  portion  of  the  proteolytic  activity  emerged 
with  the  papain  fragments  well  in  advance  of  the  peak  of  intact  papain. 

The  yield  of  active  fragments  is  small,  as  might  be  anticipated,  since 
active  papain  can  digest  other  molecules  at  a  variety  of  sites.  The  afore- 
mentioned results  with  the  aminopeptidase  digestion  indicate  that  the  active 
region  is  not  at  the  A^-terminal  portion  of  the  papain  molecule.  Hence  it 
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Fig.  7.  Chromatography  of  a  partially  autodigested  preparation  of  papain  in  0-2  m 
phosphate  buffer  at  pH  6-1  on  a  1-3x30  cm.  column  of  IRC-50.  Activity  (in  arbitrary 
units)  of  individual  fractions  towards  benzoyl-L-argininamide  is  indicated.  (Finkle  and 
Smith,  unpublished.) 

may  be  assumed  that  autolytic  cleavage  at  the  A^-terminal  region  will  pro- 
duce active  fragments  of  the  papain  molecule,  whereas  a  primary  cleavage 
at  or  near  the  C-terminal  region  will  result  in  the  formation  of  inactive 
fragments.  Present  evidence  suggests  that  most  of  the  fragments  are  en- 
zymically  inactive. 

The  results  obtained  both  by  digestion  with  the  aminopeptidase  and  by 
autodigestion  indicate  that  only  a  portion  of  the  papain  molecule  is  con- 
cerned both  in  the  catalytic  activity  of  the  enzyme  and  in  determining  speci- 
ficity. The  findings  suggest  that  it  will  be  unnecessary  to  determine  the 
complete  structure  of  the  protein  in  order  to  understand  the  properties  and 
mechanism  of  action  of  the  enzyme. 

KINETIC    STUDIES 

Evidence  concerning  the  nature  of  the  active  region  of  an  enzyme  can  fre- 
quently be  derived  from  a  study  of  the  kinetic  behavior  of  an  enzyme- 
substrate  system.  For  this  reason,  detailed  kinetic  studies  with  papain  and 
a  variety  of  synthetic  substrates  are  being  performed.  Only  one  phase  of 
these  investigations  will  be  discussed  here  since  they  have  yielded  some 
information  on  the  enzyme  groupings  which  participate  in  the  enzyme- 
substrate  interaction. 
The  usual  formulation  of  the  kinetics  of  an  enzyme  reaction  is 

ki  ko 

E+S  ^=^  ES  >  P+E 

k-i 
where  E  is  enzyme,  S  substrate,  P  products  and  ES  the  enzyme-substrate 
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complex.  Km  and  k^  can  be  determined  by  the  conventional  methods 
of  estimating  the  initial  velocity  of  hydrolysis  over  a  range  of  substrate 
concentrations. 

It  should  be  recalled  that  the  complex  constant  Km  =  (^o+^~i)/^i-  It 
has  been  shown  for  some  enzymes  that  k^k  —  ^.  In  these  cases,  Kwi  = 
kajki.  Our  studies^  have  provided  considerable  evidence  that  this  is  the 
situation  for  papain.  Therefore,  by  determining  K,n  and  ätq,  we  can  calcu- 
late the  value  of  k^,  the  velocity  constant  for  the  formation  of  the  enzyme- 
substrate  complex.  The  constants  have  been  determined  over  a  broad  pH 
range  at  three  temperatures  for  the  hydrolysis  of  benzoyl-L-argininamide.^ 
Let  us  consider  only  the  calculated  values  of  k^  which  are  shown  in  Fig.  8 
as  relative  values  arbitrarily  superimposed  at  their  maxima.  It  is  evident 
that  all  of  the  data  have  the  same  general  form  at  the  acid  pH  values  but 
the  descending  curves  on  the  alkaline  side  have  been  displaced  at  the  different 
temperatures. 


Fig.  8. 
ki  for  the 
(9).) 


EflFect  of  pH  on  ky  for  the  hydrolysis  of  benzoyl-L-argininamide.  The  average 
maximum  was  set  equal  to  100  for  each  temperature.  (From  Stockell  and  Smith 


These  data  may  be  readily  explained  on  the  basis  of  the  assumption  that 
the  descending  portions  of  the  curves  represent  the  titration  of  groups  on 
the  enzyme  which  are  essential  for  the  k^  step.  The  pH  corresponding  to 
kx  at  half  of  the  maximal  value  is  near  pH  3-9,  which  would  represent  the 
pK'  of  the  group  being  titrated.  This  pK'  value  is  consistent  with  that  of 
a  carboxyl  group.  The  absence  of  any  appreciable  effect  of  temperature  on 
this  pK'  value  is  consistent  with  the  knowledge  that  the  heat  of  ionization 
of  a  carboxyl  group  is  small. ^° 

The  alkaline  portions  of  the  k-i  may  be  interpreted  in  a  similar  manner 
but  it  is  apparent  that  the  pK'  of  the  group  being  titrated  varies  markedly 
with  temperature.  The  apparent  heat  of  ionization  is  approximately  9-5 
kilocal.  per  mole  at  0°.  Titratable  groups  present  in  proteins  which,  at  first 
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sight,  might  be  responsible  are  an  a-  or  e-ammonium  groups,  an  imida- 
zoHum,  a  phenolic  or  a  sulfhydryl  group.  It  is  very  unlikely  that  the  strongly 
cationic  substrate,  benzoylargininamide,  could  interact  with  a  cationic  site 
on  the  enzyme.  Moreover,  the  calculated  pK'  values  are  much  too  low  for 
an  e-ammonium  or  a  phenolic  group,  and  too  high  for  an  imidazolium 
group.  Papain  possesses  only  one  a-amino  group,  but,  as  already  em- 
phasized, much  of  the  amino-terminal  sequence  is  unessential  for  the  activity 
of  the  enzyme.  These  considerations  suggest  that  the  a-amino  group  does 
not  participate  in  determining  the  value  of  ki. 

In  effect,  the  only  remaining  group  to  be  considered  is  the  sulfhydryl 
group  and  it  is  already  known  that  such  a  group  is  essential  for  papain 
activity.  It  may  be  noted  that  the  pK'  values  are  in  the  range  recently  found 
for  sulfhydryl  groups, ^^  but  the  calculated  heat  of  ionization  appears  to  be 
rather  high.  However,  these  data  are  rather  limited  in  the  critical  regions 
where  pH  has  a  marked  effect  on  ki,  and  the  precision  is  rather  low.  In 
subsequent  studies,  the  effect  of  pH  on  the  hydrolysis  of  other  substrates 
has  been  determined  and,  now  knowing  the  important  range  to  be  investi- 
gated, more  complete  data  have  been  obtained. 
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Fig  9.  Effect  of  pH  on  kx  for  the  hydrolysis  of  carbobenzoxy-L-histidinamide  with  the 
maximum  for  each  temperature  set  equal  to  100.  (From  Smith,  Chavre  and  Parker  (42).) 


Fig.  9  shows  the  effect  of  pH  on  calculated  values  of  k^  for  the  hydrolysis 
of  carbobenzoxy-L-histidinamide.*^  The  data  have  been  fitted  by  drawing 
theoretical  titration  curves  for  a  single  titratable  group  at  both  sides  of  the 
curve.  On  the  acid  side  of  the  curve,  there  is  little  effect  of  temperature  and 
the  apparent  pK'  at  38°  is  near  4-3.  On  the  alkaline  side,  there  is  a  shift  of 
pK'  with  temperature  and  the  calculated  AW,  the  heat  of  ionization,  is  about 
4-4  kcal,  per  mole  at  0°.  Both  the  pK'  value  and  the  value  of  /I H  are  near 
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those  expected  for  a  sulfhydryl  group. '*^  Studies  at  38°  of  the  effect  of  pH 
on  ki  for  the  hydrolysis  of  benzoylglycinamide*^  and  of  benzoyl-L-arginine 
ethyl  ester^''  yield  curves  with  the  same  shape  as  those  shown  for  carbo- 
benzoxy-L-histidinamide. 

The  effect  of  pH  on  ki  for  another  type  of  substrate,  carbobenzoxygly- 
cylglycine,  is  shown  in  Fig.  10.^"^  On  the  alkaline  side,  the  curve  has  been 


Fig.  10.  Effect  of  pH  on  ki  for  hydrolysis  of  carbobenzoxyglycylglycine  at  38°.  (From 
Smith,  Chavre  and  Parker  (42).) 

fitted  by  a  theoretical  titration  curve  with  a  pK'  near  8-0  which  is  close  to 
that  found  for  other  substrates  at  38°.  However,  there  is  a  plateau  near  pH  6 
and  ki  rises  sharply  at  more  acid  pH  values.  It  should  be  emphasized  that 
at  pH  6,  the  absolute  value  for  ki  for  the  scission  of  carbobenzoxyglycyl- 
glycine is  only  about  0-1  per  cent  of  that  for  benzoylargininamide.  At  pH  6, 
both  the  enzyme  and  carbobenzoxyglycylglycine  would  have  carboxyl  groups 
in  ionized  form  and  electrostatic  repulsion  may  be  expected.  The  increasing 
values  of  ki  below  pH  5-5  would  appear  to  be  due  to  repression  of  ioniza- 
tion of  one  or  both  carboxyl  groups,  thus  decreasing  the  degree  of  repul- 
sion between  enzyme  and  substrate  and,  in  effect,  promoting  an  increase 
in  the  value  of  ki. 

The  kinetic  data  which  have  been  cited,  point  to  the  participation  of  the 
essential  sulfhydryl  group  in  the  ki  step.  Moreover,  since  the  optimal  region 
of  ki  is  near  pH  6,  it  is  evident  that  the  active  form  of  this  group  is  RSH 
and  not  RS~.  In  addition,  evidence  has  been  obtained  that  an  ionized  carb- 
oxyl group  is  involved  in  the  ki  step.  Inasmuch  as  the  kinetic  studies  have 
been  performed  with  rather  small  substrates,  it  is  likely  that  the  carboxyl 
group  must  be  close  to  the  sulfhydryl  group. 

A   TENTATIVE    REACTION    MECHANISM 

Thus,  the  evidence  from  kinetic  and  other  studies  indicates  that  the  essential 
sulfhydryl  group  participates  in  the  enzyme-substrate  interaction  and  points 
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to  an  important  role  of  an  ionized  carboxyl  group  in  this  reaction.  A  tenta- 
tive reaction  mechanism  for  the  action  of  papain  has  been  suggested.^** 
The  main  feature  of  this  scheme  is  that  a  thiol  ester  is  formed  in  the  enzyme 
substrate  complex. 

Briefly,  the  following  types  of  observations  on  papain  must  be  considered. 
1.  The  most  sensitive  known  substrates  are  those  which  contain  the  carb- 
oxyl group  of  an  arginyl  or  lysyl  residue  linked  to  the  sensitive  bond;  i.e., 
the  compounds  have  a  strongly  cationic  group  on  the  side  chain.  2.  Papain 
can  hydrolyze  synthetic  substrates  which  possess  ester^  or  thiol  ester**  bonds, 
as  well  as  those  with  peptide  or  amide  bonds.  3.  In  addition  to  hydrolytic 
reactions,  papain  catalyzes  synthetic  and  transfer  reactions. ^-^^  4.  There  is 
one  sulfhydryl  group  which  is  absolutely  essential  for  the  action  of  papain.^ 

Formation  of  a  thiol  ester  is  consistent  with  the  spatial  and  chemical 
specificity  of  papain  which  is  mainly  toward  the  amino  acid  residue  bearing 
the  carbonyl  group  of  the  sensitive  peptide  bond.  The  moiety  which  is  dis- 
placed from  or  transferred  to  the  carbon  of  this  carbonyl  group  may  be 
any  one  of  a  variety  of  compounds,  e.g.,  a  peptide,  an  amino  acid,  ammonia, 
aniline,  hydrazine,  hydroxylamine,  an  alcohol,  a  thiol,  etc.^ 

Postulation  of  a  thiol  ester  intermediate  would  appear  to  be  necessary  to 
explain  the  efiiciency  of  papain  in  catalysing  transfer  reactions,  since  the 
replacement  reagent  R-  NHg  competes  with  water  or  the  ions  of  water  for 
the  hydrolytic  reaction.  The  expected  free  energy  change  for  the  scission  of 
the  thiol  ester  bond  would  indicate  that  the  equihbrium  would  greatly  favor 
hydrolysis.  It  may  also  be  emphasized,  that  as  in  the  case  of  ficin,***  we 
have  found  that  the  amide  and  the  ethyl  ester  of  benzoyl-L-arginine  are 
split  at  about  the  same  rate.*^  In  fact,  the  kQ  values  are  almost  identical. 
This  suggests  that  the  cleavage  step,  in  the  overall  reaction,  involves  a 
common  intermediate,  such  as  the  postulated  thiol  ester.  In  view  of  the  usual 
large  difference  in  free  energy  of  activation  for  hydrolysis  of  similar  esters 
and  amides,  the  similar  Uq  values  are  hard  to  explain  except  in  terms  of  an 
identical  intermediate. 

Before  concluding,  it  may  be  emphasized  that  there  is  one  major  objec- 
tion to  the  view  that  a  thiol  ester  is  formed  with  a  free  thiol  group  of  the 
enzyme.  It  is  very  likely  that  the  overall  free  energy  of  formation  of  a  thiol 
ester  is  much  greater  than  that  of  the  susceptible  peptide  or  ester  bond. 
This,  in  effect,  would  make  the  overall  reaction  scheme  energetically  im- 
possible, unless  the  rate  of  scission  of  the  thiol  ester  is  sufficient  to  drive 
the  reaction  forward. 

A  possible  explanation  is  that  a  free  thiol  group  does  not  exist  as  such 
in  active  papain.  Instead,  there  may  be  present  either  a  thiol  group  bonded 
as  a  high  energy  hydrogen  bond  to  the  titrable  carboxylate  ion  or  that  a 
thiol  ester,  formed  by  coupling  of  a  carboxyl  group  and  a  thiol  group,  exists 
preformed  in  the  papain  itself.  Let  us  consider  the  latter  possibility  first. 
This  hypothesis  is  attractive  insofar  as  the  initial  reaction  between  enzyme 
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and  substrate  would  then  involve  a  transfer  reaction  with  no  net  energy 
change  to  yield  an  acyl  amino  acid  thiol  ester,  and  this  would  be  followed 
by  a  displacement  reaction  reforming  the  thiol  ester  of  the  enzyme  and 
liberating  the  acyl  amino  acid  derived  from  the  substrate.  Thus,  the  major 
part  of  the  enzymic  reaction  would  consist  of  two  transfer  reactions  involv- 
ing essentially  equivalent  thiol  ester  exchanges  resulting  in  the  hydrolysis 
of  the  substrate. 

This  possibihty  is  also  interesting  since  it  would  explain  the  catalytic 
efficiency  of  the  enzyme  more  readily  than  does  the  view  that  an  ordinary 
thiol  group  is  involved.  However,  we  do  not  know  enough,  as  yet,  to  decide 
whether  this  is  more  than  an  interesting  possibility.  Some  of  the  evidence 
at  hand  does  not  seem  to  fit  this  hypothesis  too  well,  but  other  evidence 
would  fit  very  nicely  indeed.  Just  two  points  may  be  mentioned.  As  noted 
above,  the  activity  of  papain  is  related  to  the  amount  of  sulfhydryl  which 
reacts  rapidly  in  water  with  /7-chloromercuribenzoate  (Fig.  1).  The  rapidly 
reacting  SH  may  be  in  a  high  energy  form  which  is  kinetically  more  reactive 
than  ordinary  sulfhydryl. 

The  other  finding,  which  might  be  cited,  is  that  for  four  substrates,  whose 
kinetic  constants  have  been  measured,  the  value  of  ätq  is  related  to  the  value 
of  k^.  This  is  shown  in  the  double  logarithmic  plot  of  Fig.  1 1.^^  If  we  attach 


LOG  k 


I 

Fig.  11.  A  hypothetical,  tentative  scheme  for  the  hydrolytic  action  of  papain.  (From 
Stocken  and  Smith  (9).) 

the  usual  meaning  to  these  constants,  the  relationship  suggests  that  the 
more  readily  ES  forms,  the  more  readily  it  breaks  down.  Such  a  view  would 
be  in  accord  with  the  idea  that  both  reactions,  i.e.,  those  involving  k^  and  âtq, 
involve  chemically  similar  steps,  namely  transfer  reactions  of  thiol  esters.  It 
may  be  remarked,  in  passing,  that  the  relationship  illustrated  in  Fig.  1 1  has 
also  been  found  for  a  number  of  related  substrates  for  carboxypeptidase.*' 
The  alternative  hypothesis  that  the  enzyme  contains  RCOO~  linked  to 
R'SH  to  form  a  hydrogen-bonded  structure,  RCOO" — HSR',  would  readily 
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explain  the  effect  of  pH  on  ki,  since  addition  or  removal  of  a  proton  would 
open  the  hydrogen  bond.  Similarly,  if  there  is  a  preformed  thiol  ester,  it 
may  be  postulated  that  the  linkage  is  opened  by  changes  in  pH,  but  it  would 
be  necessary  to  assume  that  refolding  of  the  protein  could  permit  regenera- 
tion of  the  thiol  ester. 

Either  hypothesis  is  attractive  since  the  presence  of  a  'high-energy'  bond 
would  aid  in  explaining  the  catalytic  activity  of  the  enzyme.  Indeed,  it  is 
possible  that  such  'high-energy'  bonds  are  an  essential  part  of  the  active 
sites  of  enzymes  and  that  their  formation  depends  on  the  specific  folding 
of  the  protein.  It  may  be  noted  that  the  activity  of  papain  is  reversibly  lost 
on  denaturation  with  urea.  Thus,  papain,  like  other  enzymes,  depends  for 
its  activity  on  a  specific  three-dimensional  structure  despite  the  fact  that  a 
large  part  of  its  amino  acid  sequence  is  unessential  for  the  maintenance  of 
its  activity. 

CONCLUDING    REMARKS 

It  is  evident  from  the  above  summary  of  some  of  our  studies  with  crystalline 
papain  that  only  a  beginning  has  been  made  in  understanding  how  the 
structure  of  this  protein  is  related  to  its  function.  It  will  take  a  long  time 
before  the  mechanism  of  action  and  the  specificity  of  this  enzyme  can  be 
related  to  its  structure.  Nevertheless,  it  is  satisfying  that,  in  our  approach 
to  the  biological  activity  of  the  proteins,  we  now  have  the  general  know- 
ledge that  considerable  modification  of  structure  is  possible  without  altera- 
tion of  the  activity  or  specificity  of  antibodies,  protein  hormones,  enzymes, 
and  viruses.  Above  all  else,  the  observations  made  on  these  substances 
indicate  the  general  validity  of  the  concept  of  the  active  site,  namely,  that 
only  a  part  of  the  structure  of  a  protein  molecule  is  responsible  for  and 
involved  in  its  specific  interactions.  Future  progress  in  the  isolation  and 
identification  of  such  active  sites  or  regions  of  proteins  will  depend  as  much 
on  our  choice  of  proteins  for  study,  as  on  the  available  tools  and  methods. 
The  proteolytic  enzymes  will  continue  to  play  a  dual  role  in  such  studies, 
since,  as  this  symposium  illustrates,  everyone  must  use  these  enzymes  in 
his  working  methods  with  other  proteins.  At  the  same  time,  they  are  among 
the  more  favorable  materials  for  investigation. 
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Rather  than  attempt  a  review  of  published  information  that  deals  with  the 
structure  of  ribonuclease  (cf.  1,  for  example),  we  will  in  the  present  dis- 
cussion emphasize  the  more  recent  work  and  the  methods  that  are  cur- 
rently being  employed  to  bring  the  structural  studies  to  a  conclusion.  Some 
recapitulation  is  desirable,  however,  and  it  can  be  furnished  by  reference  to 
the  partial  structural  formula  for  oxidized  ribonuclease  shown  in  Fig.  1, 
which  summarizes  the  status  of  our  knowledge  as  of  early  1956.^  The  single 
chain  of  124  residues  in  the  oxidized  protein  is  portrayed  in  four  segments, 
which  should  be  imagined  as  joined  end  to  end.  The  bonds  at  which  the 
chain  is  attacked  by  proteolytic  enzymes  are  indicated  by  the  vertical  lines  : 
solid  lines  for  trypsin,  dashed  lines  for  chymotrypsin,  and  dotted  lines  for 
pepsin.  The  residues  included  within  the  parentheses  are  of  undetermined 
sequences. 

The  derivation  of  the  formula  shown  in  Fig.  1  owes  very  much  to  the 
stimulating  and  pioneering  investigations  of  Sanger  and  his  colleagues  on 
the  structure  of  insulin  (cf.  3  for  references).  Several  of  the  steps  in  the 
derivation  were  patterned  directly  after  his  work  ;  others  had  to  be  changed 
to  a  greater  or  lesser  extent  depending  upon  the  nature  of  the  special 
problems  encountered.  A  sample  of  ribonuclease  containing  93  per  cent  of 
the  A  component  was  employed  for  most  of  the  structural  study.  The  fact  that 
ribonuclease  contains  but  a  single  peptide  chain,  and  that  this  chain  bears  the 
sequence  Lys.Glu.Thr.Ala-  at  the  amino-end,  was  estabhshed  by  Anfinsen 
and  his  colleagues*  by  the  use  of  the  DNP  technique.  The  first  step  in  our 
own  studies  was  an  amino  acid  analysis  that  showed  that  ribonuclease  A 
contained  124  to  126  amino  acid  residues.^  This  was  followed  by  oxidation 
of  the  single  chain  at  low  temperature  with  performic  acid^  under  condi- 
tions chosen  so  as  to  avoid  the  formation  of  chlorotyrosine. 

The  four  disulfide  bonds  in  the  molecule  were  thus  oxidized  to  eight 
sulfonic  acid  groups,  and  the  four  methionine  residues  to  the  sulfone.  The 
other  amino  acids  were  not  affected.  The  oxidized  protein  was  subjected  to 
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tryptic  hydrolysis  at  pH  7  and  25°,  the  peptides  produced  in  the  hydrolysis 
were  separated  on  a  preparative  scale  on  columns  of  Dowex  50-X2,  and 
their  amino  acid  compositions  were  determined  quantitatively  on  columns 
of  Dowex  50-X4.'  The  thirteen  principal  peptides,  formed  in  yields  of  be- 
tween 50  and  100  per  cent,  accounted  for  all  of  the  124  residues  in  the 
peptide  chain.  To  determine  the  order  in  which  these  peptides  were  joined 
to  one  another,  the  products  of  chymotryptic  and  peptic  hydrolyses  were 
similarly  isolated  and  their  amino  acid  composition  determined.  ^-^  From 
the  data  so  obtained,  it  was  possible  to  deduce  the  arrangement  shown  in 
Fig.  1  provided  three  assumptions  were  made.  These  were,  first,  that  there 
is  no  branching  of  the  peptide  chain,  and  that  all  of  the  residues  are  con- 
nected to  one  another  by  peptide  bonds  between  a-amino  and  a-carboxyl 
groups  ;  second,  that  in  peptides  formed  by  the  action  of  trypsin,  the  basic 
amino  acids,  arginine  or  lysine,  if  present,  occupy  the  carboxyl-terminal 
position;  and  third,  that  in  peptides  formed  by  the  action  of  chymotrypsin, 
a  tyrosine  or  phenylalanine  residue,  if  present,  occupies  the  carboxyl-terminal 
position. 

The  partial  structural  formula  appears  to  represent  a  unique  synthesis 
of  the  results,  in  that,  within  the  restrictions  imposed  by  the  assumptions 
made  in  its  derivation,  no  other  arrangement  of  the  peptides  has  been  found 
possible.  Moreover,  this  formula  accommodates  all  of  the  data  secured  thus 
far.  Every  peptide  that  has  been  characterized  appears  in  the  scheme  of 
proteolytic  cleavages  shown  in  Fig.  1 . 

This  partial  structural  formula  already  reveals  much  about  the  structure 
of  ribonuclease,  but  its  main  use  has  been  to  serve  as  a  point  of  departure 
for  three  relatively  independent  lines  of  research.  These  are  :  one,  the  deduc- 
tion of  the  sequence  of  the  amino  acid  residues  along  the  chain  ;  two,  the 
determination  of  the  location  of  the  four  disulfide  links  connecting  the 
eight  half-cystine  residues  ;  and,  three,  the  localization  of  the  'active  center 
of  ribonuclease,  if  indeed,  one  may  hope  that  such  a  unique  and  relatively 
small  catalytic  site  can  be  found  in  this  large  molecule.  The  results  of  all 
three  lines  of  investigation  undoubtedly  bear  on  one  another,  but  the  pre- 
sent discussion  will  be  limited  to  the  first  two  of  them. 

The  problem  posed  by  the  sequence  of  the  amino  acid  residues  in  ribonu- 
clease can  perhaps  best  be  considered  in  connection  with  Fig.  2,  which 
shows  the  sequences  as  they  are  known  at  the  present.  The  location  of  90 
of  the  1 24  residues  in  the  chain  has  so  far  been  established  (cf.  9),  and  these 
are  underlined  in  the  formula.  Work  is  currently  in  progress  to  fill  in  the 
remaining  gaps  in  the  sequence. 

In  the  sequence  work,  it  was  first  necessary  to  prepare  in  good  yield  a 
series  of  relatively  small  peptides  which  collectively  represented  all  of  the 
residues  in  the  chain  of  oxidized  ribonuclease.  The  partial  structural  formula 
was  of  great  value  in  pointing  the  way.  Because  this  formula  showed  the 
manner  in  which  the  forty-odd  peptides  formed  by  enzymatic  hydrolyses  are 
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linked,  detailed  structural  studies  of  all  of  these  peptides  were  not  required. 
It  was  possible  to  select  for  sequence  work  only  those  peptides  of  suitable 
size  which  do  not  overlap  one  another  (Fig.  1).  The  starting  point  was  the 
series  of  13  peptides  obtained  by  tryptic  hydrolysis  of  oxidized  ribonuclease. 
These  peptides  account  for  all  of  the  ribonuclease  molecule,  and  can  be 

Lys.Glu.Thr.Ala.Ala.Ala.Lys.Phf.Glu.Arg.Ser.Thi-.Ser.Ser.Asp.His.Met.Glu.Ala.Ala.  

© 

rNH2                    I       rNH2-NH2                                          i-NHg 
Ser.Ser.Asp.Ser.Tyr.Cys.Asp.Glu.Met.Met.Lys.Ser.Arg.Asp.Leu.Thr.Lys.Asp.Arg.  

©  @ 

I      rNH2                                               r-NH2                                    rNH2^                            i      pNH, 
(Cys.Asp,Val,Pro,Thr,Lys).Phe.(Glu,Val,Leu.Ser,His).(Asp,Glu,Ala,Val).(Cys,Glu,Ala,Val,Ser). 

®       "  ? 

pNH2  I  rNH2  r-NH2-NH2  rNH2 

I  I  I  I         I  I  I 

Lys.  Asp.  Val.  Ala. Cys.  Lys.  Asp. Gly.Thr.  Asp. Glu. Cys.Tyr. Glu. Ser.Tyr.Ser.Thr.  Met. Ser. lieu.  Thr. 

y                                                                                    [-NH2          I                                           pNH2-NH2 
Asp.Cys.Arg.Glu.Ser. Thr. Ser. Gly.  Lys. Tyr. Pro.  Asp.  Ala. Cys.Tyr. Lys. Thr. Thr.  Asp. Glu. Ala. Lys. 

@ 

I  n 

(Val.Ileu2,His).(Cys.Asp,Glu.Gly.Ala,Pro).Tyr.(Val2.Pro,His).Phe.Asp.Ala.Ser.Val 

Fig.  2.  Arrangement  of  the  amino  acid  residues  in  oxidized  ribonuclease.^  The  sequences 
determined  to  date  are  underlined.  The  residues  in  parentheses  are  of  undetermined  order. 
The  cysteic  acid  residues  have  been  numbered  consecutively,  I  to  VIII,  starting  from  the 
amino-end  of  the  chain. 

prepared  in  yields  ranging  from  50  to  100  per  cent.  Nine  of  these  13  pep- 
tides, varying  in  size  from  dipeptides  to  heptapeptides,  account  for  42  of 
the  124  residues  in  the  chain,  and  are  small  enough  to  be  studied  directly. 
The  structures  of  all  nine  have  been  worked  out.  The  remaining  four  pep- 
tides of  the  trypsin  series,  O-Tryp  4,  0-Tryp  9,  O-Tryp  2,  and  0-Tryp  16 
(Fig.  1),  which  contain  21,  22,  19,  and  20  residues  respectively,  account 
for  the  rest  of  the  molecule,  but  are  too  large  to  be  useful,  as  such,  for 
sequence  studies.  However,  from  the  knowledge  of  the  points  of  cleavage 
by  chymotrypsin  provided  by  the  partial  structural  formula,  it  can  be  pre- 
dicted that  each  of  these  four  large  peptides  should  be  split  individually 
with  chymotrypsin  into  fragments  of  a  more  manageable  size.  This  has  been 
carried  out,  and  in  each  instance  the  expected  segments  were  obtained.  For 
example,  the  action  of  chymotrypsin  on  O-Tryp  4,  which  contains  21  amino 
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acid  residues,  should  result  in  the  cleavage  of  the  bonds  indicated  by  the 
dashed  lines  with  the  formation  of  three  peptides.  Of  these,  one  should  be 
a  peptide  already  characterized  (O-Chy  11,  Fig.  1)  in  chymotryptic  hydro- 
lysates  of  oxidized  ribonuclease,  while  the  other  two  should  be  new  pep- 
tides with  amino  acid  compositions  in  agreement  with  those  predicted  by 
the  formula.  The  top  curve  in  Fig.  3  shows  the  separation  on  a  150  cm. 
column  of  Dowex  50-X2  of  the  three  peptides  formed  upon  chymotryptic 
hydrolysis  of  peptide  0-Tryp  4.  A  comparison  of  the  amino  acid  com- 
position of  these  peptides  (given  adjacent  to  each  of  the  peaks  on  the  curve) 
with  the  partial  structural  formula  in  Fig.  1  reveals  that  the  expected  frag- 
ments have  indeed  been  isolated. 

The  second  curve  in  Fig.  3  illustrates  the  corresponding  result  with  pep- 
tide 0-Tryp  9.  In  this  experiment,  the  yields  (shown  in  parentheses  after 
each  of  the  peaks  on  the  curve)  were  not  so  favorable,  because  additional 
cleavage  occurred  at  two  bonds  which  were  not  markedly  attacked  when 
lower  enzyme  concentrations  were  used  in  the  experiments  with  the  intact 
oxidized  protein.  Finally,  the  bottom  two  curves  in  Fig.  3  show  the  results 
obtained  upon  chymotryptic  hydrolysis  of  peptides  O-Tryp  2  and  O-Tryp  16. 
Once  again  the  products  predicted  from  the  partial  structural  formula  were 
formed.  These  experiments,  therefore,  not  only  furnished  peptides  suitable 
for  further  sequence  work,  but  the  results  also  provided  strong  substantia- 
tion for  the  deductions  that  led  to  the  derivation  of  the  partial  formula. 

Once  10  to  20  mg.  quantities  of  23  peptides  of  suitable  size  for  sequence 
work  had  been  prepared  from  the  entire  ribonuclease  molecule,  we  were 
confronted  with  a  more  familiar  type  of  problem,  namely,  that  of  deriving 
the  amino  acid  sequence  in  relatively  small  peptides.  We  thus  were  fortunate 
in  being  able  to  profit  from  the  experience  of  the  many  people  who  have 
been  through  this  territory  before  us.  A  variety  of  chemical  and  enzymatic 
methods  has  been  employed  for  the  determination  of  the  sequences  of  the 
amino  acid  residues  in  these  peptides.  The  chemical  methods  have  included 
the  DNP  procedure  of  Sanger  for  iV-terminal  residues,  hydrazinolysis  for 
the  detection  of  carboxyl-terminal  amino  acids  (by  the  Braunitzer  and 
Schramm^"  modification  of  the  Akabori  procedure),  and,  in  particular,  step- 
wise degradation  from  the  amino-end  by  the  phenyl-wo-thiocyanate  method 
of  Edman.^^  In  the  Edman  procedure,  each  step  of  the  degradation  has 
been  followed  by  quantitative  amino  acid  analysis  of  the  peptide  remain- 
ing after  removal  of  the  phenylthiohydantoin. 

In  our  hands,  the  most  effective  method  for  carrying  out  the  reaction  of 
phenyl-/5o-thiocyanate  with  the  free  amino  groups  of  peptides  has  been  the 
one  described  by  Ottesen  and  Wollenberger^^  in  their  work  on  the  peptides 
liberated  from  ovalbumin  during  the  formation  of  plakalbumin.  Routinely, 
the  reaction  is  carried  out  at  40°  in  30  per  cent  aqueous  dioxane,  and  the 
pH  is  maintained  at  8-0  with  the  aid  of  a  pH-stat.  In  this  way  the  reaction 
may  be  followed  quantitatively.  The  rate  of  the  reaction  is  a  function  of  the 


10- 


[A5p,Glu,Thr.  SePj.Met.  His]  (85^) 


(a) 

[Cy3,A5p-NHj,GlijMHj,Met2]ly5(90^; 

[A5pNH2.AlQj,5ePj] 
ill       Typ  (90%) 


100  300  350  •*» 

0.2N  pH  3.1,  35° 


[Cy5.Glu-NH2,AlQ,VaI,5ep]Ly5(80X)        [Cys.Asp  NHj,Vül,Tlir;Pro,Lys]Phe    ,,-. 


[A5p<fNHj),Glu2y(-NH2), 
>H  AlQ.VüljXeu.SenHis] 


(55%) 


15^ 


200      250      300      350      •«0     450      550      600 

I^GraduQlly  increoaing  pH  i  &<Q*J,(0.2N  pH 31  —IN  pH5.1),  35°- 


f;yaA5p,Ileu.Thn3ep]Arg(85%)     fQ\ 


[Cy5,(A5p-NH2)j,Glu-NHj,Gly,Thp]Typ 
(90%) 


^lü,5er]Typ 
(90%) 


JL 


[ThiiSeriMet] 


50  150  200  250  300         350  400  450 

I- — 0.2N  pH  31, 35° -l-Grad,  incr  pHï  [Nal(0ZNpH3HNpH51),35'"H 


f\3p,AlQ,VQl,5er]l007o 


[Cys,Aspj(-NH2),Glui/(-NH2),Gly,Ala.      (d) 
^1    Val,lleu2,Ppo,His]Typ  (90%) 


[Val2,7ro,His]?he(95%) 


.j:^ 


Effluent  ml    200  300     "400      700  600       '     1100  1200  1300 

1- — 0,2n  pH 31, 35°— KGrad  incr  pHx[Na'],{Q2NpH3i-2N pH5.1),35°H 


Fig.  3.  The  peptides  in  20-hour  chymotryptic  hydrolysates  of  peptides  O-Tryp  4,  O-Tryp 
9,  O-Tryp  2,  and  O-Tryp  16  obtained  from  performic  acid-oxidized  ribonuclease.  Chromato- 
graphy of  hydrolysates  from  approximately  5  micromoles  of  each  peptide  was  carried  out 
on  1 50  X  0-9  cm.  columns  of  Dowex  50-X2  in  the  sodium  form.  The  effluent  was  collected  in 
2  ml.  fractions.  Aliquots  (0-1  ml.)  were  removed  for  alkaline  hydrolysis  prior  to  analysis 
by  the  ninhydrin  method.  The  figures  in  parenthesis  represent  the  yield  of  each  peptide. 
The  sequence  of  the  amino  acids  within  the  parentheses  is  undetermined.  (From  Hirs, 
manuscript  in  preparation.) 

(a)  O-Tryp.  4.  Elution  of  the  column  was  with  0-2n  sodium  citrate  buffer  at  pH  3-1. 

(b)  O-Tryp  9.  Elution  was  with  0-2n  sodium  citrate  buffer  at  pH  3-1  as  far  as  150  effluent 
ml.  At  this  point,  the  pH  and  molarity  of  the  influent  buffer  were  gradually  increased  by 
the  use  of  a  mixing  chamber  of  610  ml.  volume  filled  with  the  0-2n  buffer  at  pH  3-1  into 
which  N  sodium  citrate-acetate  buffer  at  pH  5-1  was  allowed  to  flow. 

(c)  O-Tryp  2.  The  elution  procedure  followed  that  given  in  (b)  except  that  the  gradual 
change  of  pH  and  molarity  of  the  buffer  was  begun  after  220  effluent  ml. 

(d)  O-Tryp  16.  The  elution  procedure  followed  that  given  in  (b)  except  that  the  gradual 
change  of  pH  and  molarity  of  the  buffer  was  begun  after  400  effluent  ml.  After  610  effluent 
ml.,  the  reservoir  on  the  mixing  device  was  charged  with  2n  sodium  citrate-acetate  buffer 
atpH5-l. 
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Structure  of  the  peptide,  the  presence  of  a  large  R  group,  such  as  leucine, 
at  the  amino-end  being  inhibitory,  whereas  amino-terminal  glycine  reacts 
very  rapidly.  In  the  cyclization  of  the  phenylthiocarbamyl  peptides,  Edman's 
original  method  has  been  used,  the  reaction  being  carried  out  in  glacial 
acetic  acid  saturated  with  dry  hydrogen  chloride  at  100°.  Attempts  to  employ 
aqueous  acid  in  the  cyclization  step  have  met  with  varied  success.  With 
peptides  containing  several  residues  of  serine  or  threonine  not  immediately 
adjacent  to  the  amino-terminal  group,  the  use  of  dilute  aqueous  acid  led  to 
partial  hydrolysis  of  the  peptide.  Although  the  cyclization  reaction  in  glacial 
acetic  acid  does  not  always  proceed  as  rapidly  as  it  does  with  some  peptides 
in  aqueous  solution,  significant  partial  hydrolysis  in  the  non-aqueous  system 
has  not  yet  been  observed.  The  rate  of  the  cyclization  is  dependent  upon  the 
structure  of  the  peptide,  the  presence  of  an  aspartic,  glutamic,  or  cysteic 
acid  residue  at  the  amino-terminal  position  being  markedly  inhibitory. 
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Fig.  4.  Edman  degradation  of  Asp-NH2.Val.Ala.CySO3H.Lys  (0-Tryp  5).  The  chromato- 
grams  show  the  amino  acid  composition  of  acid  hydrolysates  of  the  peptides  present  at 
each  stage  of  the  procedure.  The  amino  acid  analyses  are  now  performed  on  columns  of 
sulfonated  polystyrene  resins  with  the  aid  of  automatic  recording  equipment.^^ 


In  Fig.  4  is  shown  an  example  of  how  the  Edman  procedure  has  been 
used  for  the  determination  of  the  sequence  of  the  pentapeptide  asparaginyl- 
valylalanylcysteicyllysine  (0-Tryp  5).  The  top  curve  in  Fig.  4  represents  a 
quantitative  amino  acid  analysis  of  the  acidic  and  neutral  amino  acids  in 
a  hydrolysate  of  the  pentapeptide.  A  resin,  Amberhte  IR-120,  similar  to 
Dowex  50-X8,  was  employed.  Analyses  such  as  these,  in  conjunction  with 
Edman  degradations,  are  currently  being  carried  out  with  automatic  recording 
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equipment  of  the  type  described  by  Spackman,  Stein,  and  Moore. ^^  The 
numbers  in  parentheses  adjacent  to  each  peak  are  the  molar  ratios  of 
each  component,  and  it  will  be  noted  that  the  hydrolysate  contains  nearly 
equimolar  quantities  of  cysteic  acid,  aspartic  acid,  alanine,  and  valine. 
Lysine  was  not  eluted  in  these  analyses  ;  however,  its  presence  at  the  carboxyl- 
terminal  position  could  be  inferred  from  the  specificity  of  trypsin,  and  was 
confirmed  by  other  means,  as  will  be  evident  presently.  After  the  first  appli- 
cation of  the  Edman  degradation,  carried  through  with  2-5  micromoles  of 
the  peptide,  the  quantitative  analysis  shown  on  the  second  line  of  Fig.  4 
was  obtained.  The  molar  ratios  of  the  components  are  still  unity,  with  the 
exception  of  aspartic  acid,  for  which  the  value  has  fallen  from  about  1  to 
0-23,  thus  revealing  that  aspartic  acid  or  asparagine  must  be  the  amino 
terminal  residue.  After  the  second  application  of  the  degradation,  the  ana- 
lysis on  the  third  line  of  Fig.  4  was  obtained,  which  showed  that  the  molar 
ratios  of  both  aspartic  acid  and  valine  are  now  low.  Valine  was  thus  re- 
vealed to  be  the  second  amino  acid,  while  the  third  amino  acid  was  shown 
to  be  alanine  as  a  result  of  a  further  degradation,  as  indicated  by  the  last 
curve  of  Fig.  4.  The  degradation,  when  extended  to  a  fourth  stage,  indi- 
cated a  loss  of  cysteic  acid,  which  established  the  sequence  aspartyl-  or  of 
asparaginylvalylalanylcysteicyllysine. 

The  yield  of  lysine  obtained  by  acid  hydrolysis  of  peptides  that  have 
undergone  reaction  with  phenyl-wo-thiocyanate  is  not  quantitative  and  falls 
off  unaccountably  as  the  number  of  steps  of  the  Edman  degradation  is 
increased.  For  this  reason,  analyses  for  lysine  have  not  been  included  in 
Fig.  4.  An  independent  demonstration  of  the  presence  of  lysine  at  the 
carboxyl-terminal  position  in  the  peptide  was  furnished  by  hydrazinolysis, 
which  gave  lysine  in  a  yield  of  70  per  cent  as  the  only  free  amino  acid.  The 
evidence  was  now  sufficient  to  reveal  completely  the  sequence  of  the  resi- 
dues in  the  peptide. 

The  final  question  was  whether  an  aspartyl  or  an  asparaginyl  residue  is 
present  at  the  amino-terminal  position.  Analysis  revealed,  that  on  acid 
hydrolysis,  one  equivalent  of  ammonia  was  released  per  molecule  of  pep- 
tide, indicating  that  the  amino-terminal  residue  is  probably  asparagine  and 
not  aspartic  acid.  This  was  independently  confirmed  when  it  was  found  that 
the  action  of  the  leucine  aminopeptidase  of  Spackman,  Smith,  and  Brown ^^ 
on  the  peptide  gave  asparagine,  identified  by  ion-exchange  chromatography, 
in  yields  of  27  and  91  per  cent  after  1  and  8  hours  respectively  of  hydrolysis 
with  the  enzyme.  In  addition,  two  further  peaks  were  present  on  the  effluent 
curve;  one  at  the  valine  position,  and  another  some  25  effluent  ml.  behind  it. 
They  presumably  corresponded  to  the  peptides  derived  from  the  starting 
material  by  the  successive  removal  of  asparagine  and  valine.  Similar  results 
have  been  obtained  with  some  of  the  other  peptides  examined  when  hydro- 
lysis with  either  leucine  aminopeptidase  or  carboxypeptidase  was  studied. 
In  addition  to  providing  conclusive  evidence  for  the  presence  of  the  amides 
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of  aspartic  and  glutamic  acid,  the  experiments  with  these  enzymes  have  in 
certain  instances  also  furnished  information  about  the  sequences  of  some 
of  the  residues  in  the  peptides.  The  interpretation  of  the  results,  however, 
is  not  always  clear-cut,  since,  as  was  noted  above,  partially  degraded  pep- 
tides may  move  on  the  Amberlite  IR-120  columns  at  rates  identical  with 
or  close  to  those  for  free  amino  acids. 

The  application  of  these  procedures  to  17  of  the  23  peptides  isolated  for 
sequence  studies  has  given  the  detailed  information  shown  in  Fig,  2,  which 
may  now  be  discussed  in  more  detail.  The  sequence  o'f  the  first  ten  residues 
in  the  peptide  chain  was  established  earlier;  the  first  four  as  a  result  of  Dr 
Anfinsen's  end-group  analyses  on  the  intact  protein*  and  the  rest  as  a  con- 
sequence of  our  analyses  of  the  peptides  formed  by  the  action  of  trypsin 
and  chymotrypsin.^-''  The  sequence  has  now  been  independently  confirmed 
by  the  use  of  carboxypeptidase,  and  by  step-wise  degradation.  More  recent 
work  has  revealed  the  sequence  from  the  11th  to  the  31st  residues  shown 
in  Fig.  2.  The  sequence  Ser.Arg.Asp(NH2).Leu.Thr.Lys.Asp.Arg  starting 
at  the  32nd  residue  was  previously  worked  out  by  Redfield  and  Anfinsen,^^ 
and  has  been  confirmed  in  the  present  work.  Thus  the  arrangement  of  the 
first  39  residues  in  the  chain  is  now  known.  The  portion  of  the  chain  from 
the  40th  to  the  61st  residue  is  accounted  for  by  the  tryptic  peptide  0-Tryp  9, 
containing  22  residues,  on  the  sequence  of  which  work  is  in  progress.  From 
the  sixth  lysine  residue  in  the  chain  (residue  61)  the  sequence  shown  in 
Fig.  2  passes  through  the  pentapeptide  0-Tryp  5,  which  was  considered  in 
some  detail  before,  and  continues  through  asparagine  and  glycine  for  a 
total  of  44  residues.  The  latter  is  the  longest  sequence  of  residues  worked 
out  to  date  along  the  peptide  chain  of  ribonuclease.  Finally,  the  work  of 
Dr  Anfinsen  and  his  colleagues ^^  has  estabhshed  the  sequence  of  the  four 
last  residues  in  the  chain. 

Concurrently  with  these  studies,  Dr  Spackman  in  our  laboratory, ^^  and 
Ryle  and  Anfinsen  at  Bethesda,^'  have  been  working  on  the  positions  of 
the  disulfide  bonds  in  ribonuclease.  The  approach  in  both  laboratories  is 
similar  in  several  respects  to  the  procedures  used  so  successfully  by  Ryle, 
Sanger,  Smith,  and  Kitai,^^  in  the  elucidation  of  the  disulfide  structure 
present  in  insulin. 

Ribonuclease  in  which  the  disulfide  bonds  are  intact  is  not  hydrolysed 
in  aqueous  solution  by  either  trypsin  or  chymotrypsin.  In  the  presence  of 
2  M  guanidinium  chloride,  however,  successive  hydrolysis  at  pH  7-5  by 
trypsin  and  chymotrypsin,  followed  by  desalting  and  chromatography  on 
Dowex  50-X2,  gave  the  result  shown  on  the  top  part  of  Fig.  5.  Enzymatic 
action  was  carried  out  in  the  presence  of  A^-ethylmaleimide  to  abolish  or 
at  least  minimize  the  disulfide  interchange  reaction.  The  cystine  containing 
peptides,  labeled  A  to  F  in  the  curve,  were  estimated  by  the  phosphotung- 
state  procedure  of  Kassell  and  Brand,  ^^  the  results  of  which  are  shown  by 
the  open  circles.  The  fractions  in  the  major  peaks  were  pooled,  desalted, 
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oxidized  with  performic  acid,  and  the  oxidation  mixture  was  chromato- 
graphed  as  before. 

The  results  from  Zone  C  of  the  top  chromatogram  are  shown  at  the 
bottom  of  Fig.  5.  The  cysteic  acid  peptides  formed  by  oxidation  move  well 
ahead  of  the  rest  of  the  mixture  originally  present  in  Zone  C,  and  which 
has  not  been  affected  by  oxidation.  The  fractions  containing  each  of  these 
two  fast-moving  peaks  were  pooled  and  their  amino  acid  composition  was 
determined  quantitatively.  The  empirical  formulae  for  the  pair  of  cysteic 
acid  peptides  is  shown  on  the  curve.  In  the  same  manner,  a  pair  of  cysteic 
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Fig.  5.  Determination  of  the  position  of  the  disulfide  bonds  in  ribonuclease.  Upper 
curve:  A  tryptic  and  chymotryptic  hydrolysate  from  200  mg.  of  ribonuclease  was  chromato- 
graphed.  Lower  curve:  Fractions  from  Zone  C  of  upper  curve  were  pooled,  desalted, 
oxidized,  and  chromatographed.  (From  experiments  of  Spackman;  cf.  ^^.) 

acid  peptides  of  the  following  composition  was  obtained  from  Zone  B: 
(Asp.NH2,Val,Ala,CyS03H,Lys)and(Asp.NH2,Thr,Asp.NH2,CyS03H,Glu. 
NH2,Gly,Tyr).  From  the  amino  acid  composition  of  these  two  pairs  of  pep- 
tides alone,  without  the  necessity  for  any  sequence  work,  it  could  be  con- 
cluded that  a  disulfide  bond  links  half-cystine  residues  I  and  11,  and  another 
links  half-cystine  residues  IV  and  V. 

There  is  room  for  improvement  in  the  results  thus  far,  for  the  yields  of 
the  peptides  referred  to  are  low,  and  sufficient  quantities  of  the  peptides 
corresponding  to  the  other  two  -S-S-  bonds  presumably  present  in  the 
molecule  have  not  yet  been  isolated,  although  work  in  this  direction  is  in 
progress. 

With  the  generous  permission  of  Dr  Anfinsen,  however,  Dr  Spackman's 
results  can  be  considered  in  conjunction  with  those  obtained  by  Ryle  and 
Anfinsen^'  on  the  same  subject.  By  the  use  of  quite  different  methods,  they 
have  (see  p.  229)  obtained  evidence  for  linkages  between  half-cystine  resi- 
dues II  and  VIII,  and  III  and  VII,  as  well  as  evidence  bearing  on  the  prob- 
able correctness  of  the  I- VI  and  IV-V  linkages  which  we  have  found.  Through 
Dr  Anfinsen's  courtesy  in  allowing  us  to  refer  to  his  results,  it  is  possible  to 
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visualize  the  ribonuclease  molecule  in  the  two-dimensional  schematic  repre- 
sentation shown  in  Fig.  6.  It  is  quite  clear  that  the  protein  must  exist  in  a 
tightly  wound  folded  form.  It  should  be  noted  that  the  small  disulfide  ring 
around  the  IV-V  disulfide  bond,  is  two  residues  larger  than  the  small  ring 
already  found  in  insulin  by  Sanger  and  in  vasopressin  and  oxytocin  by 
du  Vigneaud  and  his  colleagues. 

The  information  contained  in  Figs.  2  and  6  furnishes  as  complete  a  pic- 
ture of  the  covalent  structure  of  the  ribonuclease  molecule  as  can  be  drawn 
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Fig.  6.  A  two-dimensional  diagram  indicating  the  positions  of  the  four  disulfide  bonds 
in  native  ribonuclease.  (Based  upon  the  results  of  Spademan  et  al.^^  and  Ryle  and  Anfinsen.^^  ) 

now.  The  studies  outlined  in  this  communication  encourage  the  hope,  how- 
ever, that  the  complete  elucidation  of  the  structure  is  not  too  far  off.  Although 
this  appears  to  be  a  straightforward  task,  the  protein  chemist  must  always 
be  on  the  look-out  for  unsuspected  types  of  linkages  in  proteins.  Such 
linkages  have  been  found  in  bacitracin  by  Craig  and  his  colleages,^"  and 
by  Newton  and  Abraham, ^^  and  recently  in  collagen  by  Mechanic  and 
Levy. 22  It  is  becoming  increasingly  clear  also  that  even  a  complete  know- 
ledge of  all  of  the  covalent  linkages  in  ribonuclease  will  almost  certainly 
not  mean  that  the  chemical,  enzymatic,  and  physical  properties  of  the  pro- 
tein will  automatically  be  understood.  Indeed,  it  appears  as  though  detailed 
structural  knowledge  frequently  does  not  actually  solve  problems,  but  rather 
permits  the  problems  to  be  rephrased  in  different,  albeit  more  precise,  terms. 
Possibly  it  is  one  of  the  functions  of  a  symposium  such  as  this  to  draw 
attention  to  those  areas  of  protein  structure  that  lie  beyond  the  detailed 
formulae  that  we  all  are  so  happily  constructing  at  the  present  time. 
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The  structure  of  ribonuclease  in  relation  to  its 
enzymatic  activity  and  physical  properties* 

C.B.  ANFINSEN 

Laboratory  of  Cellular  Physiology  and  Metabolism, 
National  Heart  Institute,  Bethesda,  Maryland 


In  order  to  appreciate  the  present  state  of  protein  chemistry  we  need  only 
consider  the  material  being  covered  in  this  symposium  from  the  standpoint 
of  our  position  ten  years  ago.  Although  there  was  available  at  that  time  a 
large  amount  of  physical  data  relating  to  the  behavior  of  proteins  in  solu- 
tion and  in  the  solid  state,  a  rational  interpretation  of  these  data  was  essen- 
tially impossible  because  of  the  almost  complete  absence  of  quantitative 
information  on  the  sequential  structure  of  these  molecules.  Today,  the 
determination  of  the  amino  acid  sequence  of  polypeptide  chains  has,  in 
general,  become  a  technical  problem  of  only  moderate  difficulty  and  the 
application  of  available  degradative  methods  is  likely  to  permit  the  eluci- 
dation of  the  covalent  structure  of  almost  any  chosen  protein  whose  mole- 
cular weight  lies  within  reasonable  limits. 

Drs  Hirs,  Moore  and  Stein  have  presented  to  you  a  summary  of  their 
precise  studies  on  the  structure  of  ribonuclease  (see  p.  211).  Detailed  refer- 
ence to  their  original  papers  has  not  been  made  in  the  bibliography  below 
but  can  be  found  elsewhere  in  this  volume.  As  they  have  pointed  out,  we 
have  also  been  belabouring  this  protein  in  Bethesda  and  have  reached  con- 
clusions about  the  sequence  of  the  molecule  which  are  in  good  agreement 
with  their  findings.  In  view  of  this  agreement  I  will  only  discuss  some  of 
the  pertinent  results  which  begin  to  enable  us  to  correlate  the  activity  and 
physical  behavior  of  this  enzyme  with  its  covalent  and  non-covalent  structure. 

Before  beginning  these  considerations  I  would  like  to  refer  briefly  to  two 
methodological  developments  which  may  be  of  interest  to  this  audience. 
These  methods  have  to  do  with  the  general  problem  of  the  specific  hydro- 
lysis of  a  protein  molecule  to  yield  reproducible  fragments  of  moderate  size, 
a  problem  which  we  all  recognize  as  fundamental  in  structural  analysis. 

*  Those  recent  studies  discussed  in  this  chapter,  which  were  carried  out  at  Bethesda, 
are  the  combined  efforts  of  Drs  Michael  Sela,  W.  F.  Harrington,  F.  W.  White,  and 
myself.  I  would  like  to  emphasize  my  role  as  spokesman  for  this  group. 
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The  techniques  of  hydrolysis  just  described  by  Drs  Hirs,  Moore  and  Stein 
may  be  difficult  to  apply  in  the  case  of  polypeptide  chains  longer  than  that 
of  ribonuclease  since  the  number  of  overlapping  peptide  fragments  to  be 
considered  might  become  forbiddingly  large.  We  have  recently  described  a 
method  by  which  the  action  of  trypsin  may  be  restricted  to  those  peptide 
bonds  in  which  arginine  constitutes  the  /^-terminal  side.^  This  procedure 
leads  to  the  formation  of  peptide  fragments,  one  more  in  number  than  the 
number  of  arginine  residues  in  the  chain.  The  reactions  involved  are  sum- 
marized in  the  following  equation  : 


Trypsin 


HBr 


Cbzo  Cbzo  Cbzo 

I                          I  I 

(A^-terminal) Lys Arg Lys ....  (C-terminal)    - 

(residue)  (residue) 

Cbzo  Cbzo  Cbzo 

I  I  I 

iV-term Lys Arg+NHoR Lys (C-term.)   > 

HCOOH 
benzyl  bromide +C02+unsubstituted  peptide  fragments  (as  HBr  salts) 
(Cbzo  -  CgHs .  CH2 .  OC— O— ) 

As  indicated,  carbobenzoxylation  of  the  e-amino  groups  of  lysine  prevents 
the  hydrolysis  of  lysyl  bonds  by  trypsin.  Following  digestion,  the  carbo- 
benzoxy  groups  are  removed  to  yield  a  mixture  of  peptides,  which  may  be 
separated  by  one  of  a  variety  of  methods,  for  subsequent  degradative  ex- 
periments. 

It  has  been  the  common  experience  of  many  investigators  that  the  trypsin 
digestion  of  native  proteins  proceeds  very  slowly  due,  presumably,  to  the 
steric  hindrances  introduced  by  the  presence  of  disulfide  bridges.  Hitherto, 
the  rupture  of  these  bridges  has  been  generally  accomplished  by  performic 
acid  oxidation  which,  unfortunately,  causes  extensive  destruction  of  trypto- 
phan residues.  The  full  usefulness  of  the  carbobenzoxylation  method  de- 
pends, therefore,  on  the  use  of  techniques  for  the  cleavage  of  S-S  bridges 
under  conditions  which  are  not  likely  to  damage  portions  of  the  sequence 
containing  tryptophan.  As  an  approach  to  this  problem,  the  disulfide  bridges 
of  ribonuclease  (which  contains  no  tryptophan)  and  of  lysozyme  (containing 
6-7  residues  of  tryptophan)  have  been  reductively  cleaved  with  thioglycollic 
acid  in  8m  urea  and  the  resulting  sulfhydryl  groups  have  been  converted 
to  their  »S-carboxymethyl  derivatives  by  treatment  with  iodoacetic  acid,^ 
as  shown  in  the  equations  opposite. 

Fraenkel-Conrat  et  al.  have  previously  described  a  reductive  cleavage  of 
lysozyme  where  SH  groups  were  masked  by  treatment  with  iodoacetamide.^ 
The  use  of  iodoacetic  acid,  however,  appears  to  have  a  distinct  advantage 
since  the  final  carboxymethylated  derivative  is  quite  soluble  over  a  wide 
range  of  pH  values  in  contrast  to  the  amide  form. 
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The  quantitative  character  of  the  reduction  and  alkylation  procedures  has 
been  confirmed  by  titration  of  the  reduced  proteins  with  /?-chloromercuri- 
benzoate  and  by  the  isolation  and  determination  of  A^-dinitrophenyl-^"- 
carboxymethyl-L-cysteine  from  dinitrophenylated  hydrolysates  of  the  alkylated 
derivatives.  2  Table  1  shows  data  comparing  the  determination  of  SH  groups 
by  the  two  methods  in  ribonuclease  at  different  stages  of  reduction.  As  can 

Table  1 

ANALYSIS    FOR    CYSTEINE    IN    RIBONUCLEASE    AT    VARIOUS 

STAGES    OF    REDUCTION,    BY    /7-CHLOROMERCURIBENZOATE 

(PCMB)    TITRATION*    AND    BY   ESTIMATION    OF 

A^-DINITROPHENYL-5-CARBOXYMETHYL-CYSTEINE 


SH  groups  by 

DNP-S-carboxy- 

PCMB  titration 

methyl-cysteine* 

4-2 

5-1 

5-2 

5-3 

60 

5-2 

7-5 

7-8 

80 

8-7 

8-1 

8-4 

8-2 

8-3 

*  Determined  in  dinitrophenylated  hydrolysates  of  reduced,  alkylated  protein.  Cor- 
rected for  losses  due  to  hydrolysis  and  chromatography  essentially  as  described  by  Levy.^ 
Destruction  of  carboxymethylcysteine  during  acid  hydrolysis  is  on  the  order  of  5  %  or 
less  (16  hours  in  constant-boiling  HCl  at  110°C.). 

Pps 
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be  seen  here,  the  chromatographic  separation  of  A^-dinitrophenyl-^-carboxy- 
methylcysteine  and  its  subsequent  elution  and  estimation  compares  well  with 
the  spectrophotometric  method.  Table  2  deals  with  similar  experiments  on 

Table  2 

ESTIMATION  OF  CYSTEINE,  ASPARTIC  ACID, 
AND  GLUTAMIC  ACID  CONTENT  OF  FULLY 
REDUCED,  CARBOXYMETHYLATED  LYSOZYME 


Theoretical* 

Found Î 
Exp.  1         Exp.  2 

DNP-Asp 
DNP-Glu 
DNP-SCMC§ 

bis-DNP-cystine 

20 

4-5t 

10 

0 

20 

4-8 
10-6 

00 

20 

61 
100 

0-2 

The  reduced,  alkylated  protein  was  hydrolyzed  at  110°  for  16  hours  in  6n-HC1.  The 
hydrolysate  was  dinitrophenylated^  and,  following  chromatography  according  to  Levy,^ 
the  spot  containing  the  DNP  derivatives  of  glutamic  acid,  aspartic  acid,  and  S-carboxy- 
methyl-cysteine  was  excized.  The  mixture  of  derivatives  was  separated  in  the  tertiary  amyl 
alcohol  system  of  Blackburn  and  Lowther,^  running  the  chromatograms  in  the  descend- 
ing direction  for  48  hours. 

lysozyme  and  indicates  that  the  application  of  the  reductive  and  alkylating 
techniques  to  this  tryptophan-rich  protein  furnishes  a  convenient  method 
both  for  opening  the  cross-linked  chain  and  for  the  subsequent  determina- 
tion of  the  cystine  content  of  the  protein. 

It  has  also  been  found  that,  under  the  conditions  of  reduction  and  alkyla- 
tion  employed,  histidine,  tryptophan,  and  tyrosine  residues  in  the  polypeptide 
chain  are  unchanged.  Michaelis  and  Schubert,  in  1936,'^  demonstrated  that 
only  SH  groups  are  alkylated  by  haloacetic  acids  under  sufficiently  mild 
conditions  (room  temperature  and  pH  values  in  the  neighbourhood  of  8). 
It  has  recently  been  reported  by  Korman  and  Clarke  that  more  vigorous 
conditions  (40°,  pH  9,  4-5  hours)  are  required  for  the  alkylation  of  free 
amino  groups  and  that  the  ring  nitrogen  atoms  and  the  phenolic  hydroxyl 
group  of  tyrosine  are  even  more  resistant  to  reaction.^  We  have  further 
examined  these  possible  side  reactions  under  the  conditions  employed  by  us 
(room  temperature,  pH  8,  2  hours)  and  find  that  less  than  5%  of  the  histidine 
and  tyrosine  residues  and  free  amino  groups  become  carboxymethylated 
under  conditions  where  reaction  of  SH  groups  is  complete, 

*  The  'best'  values  as  mentioned  by  Thompson.^ 

t  The  value  for  glutamic  acid  is  undecided.  Thompson's  value  of  7,  deduced  from 
sequence  studies,  is  probably  high.  Lewis  et  al.^^  give  4,  as  do  Dose  and  Caputo,^^  and 
Thompson^  gives  5-1  on  the  basis  of  amino  acid  analysis. 

X  Calculated  assuming  the  presence  of  20  residues  of  aspartic  acid  in  the  protein. 

§  A^-dinitrophenyl-5-carboxymethylcysteine. 
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When  fully  reduced,  alkylated  ribonuclease  was  carbobenzoxylated  and 
then  subjected  to  trypsin  digestion,  results  were  obtained  that  completely 
paralleled  those  obtained  with  carbobenzoxylated,  oxidized  ribonuclease. 
End-group  analysis  of  trypsin  digests  of  these  two  forms  of  the  extended 
ribonuclease  chain  are  shown  in  Table  3.  In  both  cases,  the  peptide  bond 

Table  3 

N-TERMINAL    AMINO    ACID    ANALYSES    OF    TRYPTIC    DIGESTS 

OF  CARBOBENZOXYLATED  OXIDIZED   RIBONUCLEASE  AND  OF 

REDUCED    AND    CARBOXYMETHYLATED    RIBONUCLEASE 

The  carbobenzoxy  groups  were  subsequently  removed  with  anhydrous  hydrogen  bromide. 


Oxidized  ribonuclease 

Reduced  and  carboxymethylated 
ribonuclease 

Found 

Theor. 

Found 

Theor. 

DNP-Asp 

DNP-Glu 

DNP-CySOgH 

e-DNP-Lys 

bis-DNP-Lys 

10 
1-8 
0-7 
100 
0-9 

10 
20 
10 
90 
10 

DNP-Asp 

DNP-Glu 

DNP-SCMC* 

e-DNP-Lys 

bis-DNP-Lyst 

10 
1-8 
0-8 
90 
0-4 

10 
20 
10 
90 
10 

which  joins  arginine  to  a  half-cystine  residue  in  the  native  protein  was  the 
most  resistant  to  digestion,  although  in  one  case  the  half-cystine  residue  was 
in  the  form  of  cysteic  acid  and  in  the  other  was  present  as  ^-carboxymethyl- 
cysteine. 

Data  on  lysozyme  are  presented  in  Table  4.  The  reduced,  alkylated  pro- 
tein was  treated  with  trypsin  in  this  case  without  prior  carbobenzoxylation. 
Lysozyme  contains  eleven  arginine  and  six  lysine  residues  and  should  there- 
fore yield  eighteen  peptides  upon  trypsin  digestion,  if  all  the  lysyl  and 
arginyl  bonds  are  intrinsically  susceptible  to  trypsin  action  (i.e.  not  adjacent 
to  proline,  etc.).  Electrophoretic  patterns^^  of  such  a  digest  are  shown  in 
Figs.  1a  and  1b.  Fig.  1a  shows  the  pattern  obtained  in  1-5  hours  at  35v/cm. 
(pH  6-5,  pyridine,  acetate  buffer),  and  Fig.  1b  in  2-5  hours  under  the  same 
conditions.  The  end-group  data  in  Table  4  show  that  approximately  fifteen 
peptide  bond  cleavages  are  detectable  after  digestion  and  that  these  may, 
in  most  instances,  be  accounted  for  in  the  lysyl  and  arginyl  sequences  deter- 
mined to  be  present  in  lysozyme  by  Thompson,^  by  Schroeder,^^  and  by 
Acher,  et  al.^^  It  is  worthwhile  to  point  out  that  the  time  required  for  com- 
plete cleavage  of  the  reduced,  fully  alkylated  lysozyme  chain  was  less  than 
five  minutes  at  37°,  employing  trypsin  at  a  level  of  1  %  that  of  the  substrate. 

*  A^-dinitrophenyl-iS-carboxymethylcysteine. 

t  The  values  for  bis-DNP-lysine,  both  in  lysozyme  and  in  ribonuclease,  are  sometimes 
unaccountably  low. 
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DETERMINATION   OF   A^-TERMINAL    AMINO    ACIDS   IN   A   TRYPSIN 
DIGEST  OF   FULLY   REDUCED,   CARBOXYMETHYLATED    LYSOZYME 

DIGESTION     conditions:      1%    LYSOZYME,    0-01%     TRYPSIN,    pH    8.0,    25°. 

PROTEOLYSIS    COMPLETE    (BY    pHSTAT    MEASUREMENT)    AFTER    5     MINUTES. 

END-GROUP    ANALYSIS    BY    METHOD    OF    LEVY^ 


Amino  acid 

Total  in  lysozyme 

End-groups  expected 
from  sequence  data* 

End-groups 
found 

Aspartic  Acid                    \ 

^  ]  Total 

Glutamic  acid 

35 

1      4 

4-0 

S'-carboxymethyl  cysteine  j 

3) 

Serine 

9 

— 

0-7 

Threonine 

7 

— 

M 

Glycine 

11-12 

3 

1-9 

Alanine 

10-12 

— 

0-3  (?) 

Valine 

5-6 

1 

1-7 

Leucine  (+isoleucine) 

8 

2 

2-8 

Phenylalanine 

3 

1 

M 

Lysine  (N-terminal  in  lyso- 

zyme) 
Total 

1 

1 

10 

12 

ca.  15t 

In  concluding  these  preliminary  remarks  about  methods  I  should  em- 
phasize that  a  real  measure  of  the  value  of  the  reductive  cleavage  of  disulfide 
bridges  and  of  the  trypsin  digestion  of  carbobenzoxylated  polypeptide  chains 
will  become  apparent  only  when  these  methods  have  been  properly  applied 
to  tryptophan-containing  proteins  larger  than  ribonuclease  and  lysozyme. 

To  proceed  with  the  main  purpose  of  this  discussion,  as  indicated  in  its 
title,  I  should  like  now  to  consider  some  aspects  of  ribonuclease  structure 
in  relation  to  biological  and  physical  properties.  As  you  have  seen,  the 
sequential  work  on  this  protein  is  well  along  and  the  full  sequence  should 
be  forthcoming  in  a  relatively  short  time.  As  all  of  us  will  agree,  however, 
a  knowledge  of  amino  acid  sequence  alone  is  of  little  use  in  the  understand- 
ing of  three-dimensional  structure  and  of  the  basic  reasons  for  biological 
activity.  It  has  become  increasingly  evident  that  the  activity  of  an  enzyme, 
for  example,  must  ultimately  be  explained  in  terms  of  the  secondary  and 
tertiary  structure  of  the  molecule,  determined  and  rigidly  fixed  by  covalent 
cross  linkages  and  by  non-covalent  bonds  of  differing  strengths. 

The  Rockefeller  Institute  group,^'  and  ourselves,^*  have  recently  under- 
taken independent  investigations  of  the  disulfide  bridges  of  ribonuclease  by 

*  From  sequence  data  of  Thompson,^  W.  Schroeder.^^  and  R.  Acher  et  al.^^  Amino 
acids  listed  are  those  following  either  lysine  or  arginine  in  the  sequences  hsted  in  the 
above  papers. 

t  Lysozyme  contains  6  lysine  and  1 1  arginine  residues.  Theoretical  maximum  number 
of  end-groups  in  a  complete  trypsin  digest  should,  therefore,  be  18. 


+ 


+ 


+ 


A  B 

Fig.  1  Fig.  2 

Fig.  1.  Electrophoretic  patterns^-  of  trypsin  digests  of  reduced,  alkylated  lysozyme:  (A), 

1-5  hours,  pH  6-5  pyridine-acetate  buffer,  35  V/cm.;  (B),  2-5  hours,  same  conditions. 
Fig.  2.  Electrophoretic  patterns^-  of  subtilisin  digest  of  native  ribonuclease.^^  Conditions 

as  in  Fig.  \A. 
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methods  essentially  the  same  as  those  employed  by  Sanger  and  his  col- 
laborators for  insulin. ^^  In  our  studies,  subtilisin  digests  of  native  ribonu- 
clease  were  subjected  to  electrophoresis  by  the  method  of  Michl^^  and  the 
five  ninhydrin-positive  bands,  which  also  exhibited  a  positive  reaction  for 
disulfide  bonds,^^  were  eluted.  A  typical  ninhydrin  pattern  is  given  in  Fig.  2. 
One  of  these  cystine-containing  components  proved  to  be  too  large  and  com- 
plicated for  direct  study.  The  other  four  were  oxidized  with  performic  acid  and 
the  resulting  cysteic  acid-containing  peptides  were  separated  and  analysed  for 
amino  acids.  The  results  of  these  analyses  are  given  in  Table  5.  A  considera- 
tion of  these  analyses,  taken  together  with  the  available  data  on  the  amino 

Table  5 
THE    LOCATION    OF    DISULFIDE    BRIDGES    IN    RIBONUCLEASE 


Disulfide- 

positive 

Band 

Composition  of  cysteic-acid  containing 
peptides  produced  by  oxidation 

Half-cystine 

residues 

implicated 

Disulfide 
bridge 

A 

C 
D 

E 

(l)CyS03-Asp-Arg-Glu-(Ser2,Thr,Gly) 
(2)  (CyS03,Asp,Glu) 

(I)  (Ser,GIu,CyS03)-Lys 
(2)(GIy,Asp,CyS03,Pro,Ala,Lys,Tyr) 
(1)  CyS03-Z-Ala,Tyr  7 
(2)Asp-Arg-CyS03-(Pro,Lys,Asp,Val,Thr)-Phe 

(1)  (GIy,Asp,CyS03,Pro,Ala,Lys,Tyr,Ser) 

(2)  (Seri.i,Glu2.o,Asp2.7,Lysi.8,Tyro.9, 
CySOa  3.2,GIyo.7,Thro.6,Alai.o,Vali.o 

No.  VI        1 

Nos.  I,  III,     \ 

V  and  VIII    J 

No.  Ill       \ 

No.  VII       J 

No.  VIII      \ 

No.  II        J 

No.  VII       1 

Nos.  Ill,  IV     1 

and  V         ^ 

together       J 

Nos.  I-VI 

Nos.  III- 

VII 

Nos.  II- 

VIII 

Nos.  IV- 
V 

acid  sequence  and  distribution  along  the  polypeptide  chain  of  ribonuclease, 
permits  the  conclusion  that  the  half-cystine  residues  of  the  protein  are  paired 
as  follows:  No.  I-No.  VI,  No.  II-No.  VIII,  No.  III-No.  VII,  and  No.  IV- 
No.  V  (numbering  the  half-cystines  from  I  to  VIII,  beginning  at  the  A^- 
terminal  end  of  the  protein).  Drs  Spackman,  Stein  and  Moore^^  have 
independently  identified  the  first  and  fourth  of  these  bridges  as  well  and 
are  now  in  the  process  of  examining  the  remaining  two.  Their  studies  are 
of  particular  importance  since  complete  agreement  in  the  data  from  the  two 
laboratories  will  help  to  rule  out  the  ever-present  possibility  that  there  might 
have  occurred  some  disulfide  exchange  or  other  rearrangement  during  the 
course  of  experimental  study.  The  assignment  of  the  disulfide  bridges  to- 
gether with  the  sequential  information  we  now  have  enables  us  to  construct 
the  schematic  and  provisional  diagram  shown  in  Fig.  3.  In  this  figure  the 
heavily-encircled  residues  are  arranged  in  an  order  which  is  known  (or 
nearly  so — there  exists  an  unresolved  disagreement  in  the  sequence  adjacent 
to  half-cystine  No.  VI.  This  portion  of  the  sequence  should  probably  read, 


230  C.  B.  ANFINSEN  [13 

Asp(NH2)-CyS03-Arg,  rather  than  CyS03-Asp(NH2)-Arg.  The  former  has 
been  deduced  by  Dr  Hirs  using  the  Edman  degradative  method,  while 
the  latter  is  from  our  less  rigorous  studies  using  partial  acid  hydrolysis.)  The 
geometrical  restrictions  introduced  into  the  ribonuclease  molecule  by  the 
disulfide  bridges  are  severe  and  suggest  that  the  marked  changes ^^  that 
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Fig.  3.  Schematic  diagram  of  ribonuclease  structure.  An  represents  asparagine;  Gn 
glutamine.  The  location  of  amide  groups  is  possible  from  the  work  of  Hirs,  Moore,  and 
Stein.  (See  p.  218.) 

occur  when  the  protein  is  dissolved  under  'denaturing'  conditions  such  as 
8m  urea  are  due  to  the  unfolding  of  the  uncrosslinked  'tails'  and  perhaps 
to  partial  expansion  of  coiled  regions  between  S-S  bridges. 

I  would  like  now  to  outline,  in  a  brief  way,  the  results  of  studies  on  the 
relationships  between  the  structure  and  function  of  ribonuclease. 


1.   REDUCTIVE  CLEAVAGE  OF    DISULFIDE  BRIDGES^ 

As  discussed  above,  thioglycollic  acid  causes  a  rapid  reduction  of  disulfide 
bridges  in  ribonuclease  in  the  presence  of  8m  urea  at  pH  values  in  the  neigh- 
borhood of  8.  In  the  absence  of  urea,  however,  only  partial  reduction  occurs, 
and  in  a  way  which  suggests  that  two  of  the  four  bridges  in  the  native  pro- 
tein are  particularly  susceptible  (Fig.  4).  During  the  reaction  the  activity  of 
the  enzyme  decreases  in  a  manner  (Fig.  5)  which  indicates  that  certainly 
one,  and  perhaps  two,  of  the  bridges  are  not  essential  for  catalysis.  »S-Carb- 
oxymethylation  of  the  SH  groups  does  not  affect  the  enzyme  activity  of  the 
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intermediates.  Regeneration  of  enzymatic  activity  takes  place  upon  reoxida- 
tion  of  the  sulfhydryl  groups  with  molecular  oxygen  and  'fingerprints'  (filter 
paper  electrophoretic  patterns  of  subtilisin  digests)  of  the  reaction  mixture 
after  reoxidation  (Fig.  6)  suggest  that  there  occurs  a  partial  reformation  of 
the  same  disulfide  bonds  as  those  occurring  in  the  native  molecule.  ^^  Specific 
identification  of  the  labile,  presumably  non-essential,  disulfide  bonds  has 
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Fig.  4.  The  kinetics  of  reduction  of  the  disulfide  bridges  of  ribonuclease  in  the  presence 
and  absence  of  8  m  urea.^^ 


SH    GROUP 


Fig.  5.  Activity  of  ribonuclease  at  various  stages  of  reduction  (expressed  as  percentage 
of  the  specific  activity  of  native  ribonuclease)  as  a  function  of  the  number  of  moles  of 
sulfhydryl  per  mole  of  enzyme.  A,  Reduction  in  absence  of  urea;  ^,  reduction  in  8m  urea; 
□  ,  reoxidation  of  fully  reduced,  inactive  ribonuclease;  0>  reoxidation  of  samples  con- 
taining more  than  six  sulfhydryl  groups  per  average  molecule;  V.  reoxidation  of  samples 
containing  about  four  sulfhydryl  groups  per  average  molecule.^ 
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not  yet  been  achieved.  However,  it  is  of  interest  that  the  active,  partially 
reduced  molecules,  free  of  detectable  amounts  of  native  enzyme,  are  much 
more  susceptible  to  proteolytic  digestion  (Fig.  7)  than  the  native  protein 
and  that  enzyme  activity  persists  even  after  considerable  further  degrada- 
tion with  subtiHsin.^^  This  latter  observation  is  promising  as  regards  the 
production  of  a  relatively  small,  catalytically  active,  'core'. 
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Fig.  6.  Drawing  of  the  disulfide-positive  bands  of  the  electrophoretic  patterns  obtained 
on  subtilisin  digests  of  native,  reduced,  and  reoxidized  ribonuclease.  Same  conditions  as 
in  Fig.  1a.  The  sample  which  was  reoxidized  with  molecular  oxygen  contained  8-0  SH 
groups  and  was  completely  inactive  before  oxidation.^* 
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Fig.  7.  Digestion  of  ribonuclease,  at  various  stages  of  reduction,  with  subtilisin  and 
tiypsin.  The  reduced  derivatives  were  carboxymethylated,  before  digestion.  Alkali  uptake 
was  followed  in  the  Coleman  autotitrator  at  pH  8.  Temperature,  37°,  except  where  indi- 
cated. Substrate  concentration,  10  mg/ml.  Proteolytic  enzyme  concentration,  1  %  of  this 
level.^^ 
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2.    PROTEOLYTIC    DIGESTION 

As  shown  by  Richards^^  limited  subtilisin  digestion  of  ribonuclease  causes 
the  cleavage  of  what  appears  to  be  a  single  peptide  bond  located  within 
twenty  or  so  residues  of  the  A^-terminal  end  of  the  polypeptide  chain. ^^  The 
resulting  macromolecular  derivative  has  full  enzymatic  activity.  Degradation 
of  the  C-terminal  end  of  ribonuclease  with  carboxypeptidase  pre-treated  with 
diisopropyl  fluorphosphonate  (DIP)  removes  the  C-terminal  valine  residue 
(and  a  variable  amount  of  the  two  preceding  residues,  serine  and  alanine) 
without  loss  of  activity. 2^  Non-DIP-treated  carboxypeptidase  effects  a  con- 
siderably greater  number  of  cleavages  and  still  leaves  most  of  the  activity 
even  after  fairly  extensive  attack. ^^ 

Limited  pepsin  digestion  at  pH  1  -8,  on  the  other  hand,  causes  the  com- 
plete inactivation  of  ribonuclease  at  a  rate  parallel  to  the  rate  of  release 
of  the  C-terminal  tetrapeptide  sequence. ^^  No  indications  of  rupture  of 
other  peptide  bonds  have  been  found  by  a  variety  of  chemical  and  physical 
methods. 2*  The  inactive  derivative  contains  a  C-terminal  phenylalanine  resi- 
due as  would  be  expected  from  the  sequence  data  shown  in  Fig.  3  above. 

3.    CORRELATIONS    BETWEEN    SPECIFIC    ASPECTS 
OF    TERTIARY    STRUCTURE    AND    ENZYMATIC 

ACTIVITY 

(a)  Urea.  In  an  earlier  paper  it  was  suggested  that  enzymatic  activity  in  this 
protein  is  not  dependent  upon  an  intact  system  of  hydrogen  bonds  since 
activity  is  undiminished  in  8m  urea  solutions. ^^  Our  more  recent  studies 
indicate  that  this  conclusion  is  probably  not  quantitatively  valid.  This  re- 
appraisal is  necessary  in  view  of  the  results  of  experiments  in  which  the 
marked  shift  in  the  spectrum  of  ribonuclease  to  lower  wavelengths  in  8m 
urea  solutions  has  been  studied  as  a  function  of  the  concentration  of  certain 
polyvalent  anions  and  polyanions.^^-^^  The  curves  in  Fig.  8,  for  example, 
indicate  the  differences  in  molar  extinction,  between  native  ribonuclease  in 
0-1  M-KCl  and  in  8m  urea  (curve  B),  and  in  8m  urea  containing  0-003m 
orthophosphate  ions  (curve  C).  Low  levels  of  orthophosphate,  arsenate, 
polymetaphosphate,  and  uridylate  (but  not  citrate)  ions  cause  a  complete 
reversion  of  the  shifted  spectrum  to  that  exhibited  by  the  native  enzyme 
in  the  absence  of  urea.  This  reversion  is  quantitative  at  concentrations  of 
ion,  as  low  as  0'05m  (i.e.,  about  60-70  atoms  of  phosphorous/mole  of  ribonu- 
clease). A  summary  of  these,  and  related  experiments  on  ultraviolet  absorp- 
tion characteristics,  is  given  in  Table  ö.^^-^^-^^  The  table  includes,  for 
comparison,  the  results  of  experiments  with  monovalent  anions,  which  do 
not  show  such  an  effect  except  at  very  high  ionic  strengths.  (It  should  be 
stated  that  the  most  desirable  test,  namely  an  estimate  of  the  refolding 
capacity  of  ribonucleic  acid  itself,  is  not  technically  feasible  because  of  the 
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high  extinction  coefficient  of  this  material.)  These  findings  strongly  suggest 
that  the  substrate,  ribonucleic  acid,  is  capable  of  refolding  the  protein  in 
spite  of  the  presence  of  8m  urea  and  that  the  structural  configuration  respon- 
sible for  catalytic  activity  is  unchanged  by  this  agent  under  the  conditions 
of  assay. 

Table  6 

SOME    PHYSICAL    PROPERTIES    OF    RIBONUCLEASE    AND 
MODIFIED    RIBONUCLEASE 


Ribonuclease 

Unfold- 
ing agent 

Anion 

Je  X  10-3 
(at  285 

D 

Ac* 

7?Sp/'= 

Native 





1120 

-71-7° 

230-7 

0036 

Native 

8  M  urea 

— 

0 

-103-7° 

220 

0-085 

Native 

8  M  urea 

0-50  M  acetate 

0 

— 

— 

— 

Native 

8  M  urea 

0-40  M  chloride 

110 

-98-8° 

— • 

— 

Native 

8  M  urea 

0  05  M  sulfate 

— 

-102-5° 

— 

— 

Native 

8  M  urea 

0-25  M  sulfate 

840 

-88-0° 

— 

— 

Native 

8  M  urea 

0  001  M  phosphate 

112 

-103-7° 

— 

— 

Native 

8  M  urea 

0  003  M  phosphate 

390 

-102-9° 

— 

— 

Native 

8  M  urea 

0-01  M  phosphate 

900 

-99-9° 

— 

0-070 

Native 

8  M  urea 

0  02  M  phosphate 

— 

— 

— 

0-057 

Native 

8  M  urea 

0-15  M  phosphate 

1120 

-81  0° 

— 

0050 

Native 

8  M  urea 

0-15  M  arsenate 

1120 

-80-7° 

— 

— 

Native 

8  M  urea 

0-15  M  pyrophos- 
phate 

1120 

— 

— 

— 

Native 

8  M  urea 

0-003  M  polymeta- 
phosphate 

447 

— 

~ 

Native 

8  M  urea 

0-055  M  polymeta- 
phosphate 

1120 

-82-8° 

~ 

Native 

8  M  urea 

0-15  M  uridylate 

— 

— 

— 

0-052 

Native 

8  M  urea 

0-15  M  citrate 

0 

— 

— 

— 

Native 

6M 

guani- 

dine  HCl 

0 

-108-1° 

216 

0-097 

Native 

6M 

guani- 

dine  HCl 

0-30  M  phosphate 

0 

-102-3° 

217-5 

0104 

Oxidized 

— 

— 

0 

-91-6° 

226 

0-116 

Reduced  and 

— 

— 

0 

— 

— 

0-149 

carboxymethy- 

lated 

Pepsin  inacti- 

— 

— 

0 

-83-8° 

— 

0-039 

vated 

Subtilisin     di- 

— 

— 

1120 

— 

— 

— 

gested 

Methyl  ester 

~ 

0 

-84-0° 

The  concentration  of  the  proteins  was  2-5-2-7  g./l.  for  spectrophotometric  readings, 
25-30  g./l.  for  Polarimetrie,  and  14-15  g./l.  for  viscometric  measurements.  All  measure- 
ments were  carried  out  at  pH  6-7. 

*  See  Yang  and  Doty  (1957). 
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Other  data  in  Table  6  indicate,  however,  that  refolding  by  anions  is  not 
quantitatively  complete.  Thus  measurements  of  the  optical  rotatory  properties 
and  the  intrinsic  viscosity  of  ribonuclease  in  8m  urea  solutions  contain- 
ing orthophosphate  or  polymetaphosphate  ions  give  values  that  are  com- 
patible with  a  still  partially  unfolded  form  of  the  protein.  The  combined 
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Fig.  8.  Decreases  in  extinction,  compared  with  native  ribonuclease  in  water,  of  (A), 
inactive  derivative  produced  by  limited  pepsin  digestion,  (B),  ribonuclease  in  8m  urea, 
and  (C),  ribonuclease  in  8m  urea  containing  0  003m  orthophosphate  ions.  Curves  calcu- 
lated from  individual  curves  obtained  on  Cary  ultraviolet  recording  spectrophotometer.  2» 

data  suggest  that  that  portion  of  the  molecule  which  is  responsible  for  the 
modified  light  absorption  of  native  ribonuclease  is  also  concerned  with 
catalytic  activity  and  that  catalysis  is  unimpaired  in  spite  of  minor  structural 
alterations  elsewhere  in  the  fabric  of  the  protein. 

(b)  Pepsin  digestion.  This  conclusion  is  further  supported,  in  a  converse  sort 
of  way,  by  the  spectrophotometric  and  optical  rotatory  results  obtained  on 
the  derivative  produced  by  limited  pepsin  digestion. ^^-^^  This  derivative, 
which,  as  was  discussed  above,  differs  from  the  native  enzyme  only  by 
lacking  the  four  C-terminal  residues,  shows  a  shifted  spectrum  (curve  A, 
Fig.  8)  similar  to  that  seen  in  8m  urea.  Orthophosphate  is  without  effect  on 
this  pepsin-induced  shift.  The  specific  optical  rotation  of  the  derivative  is 
similar  to  that  obtained  with  native  ribonuclease  or  with  ribonuclease  in 
8m  urea  solutions  containing  polyvalent  anions.  It  may  be  tentatively  con- 
cluded, therefore,  that  the  secondary  structure  of  the  derivative  is  reasonably 
'native',  a  conclusion  also  consistent  with  earlier  estimates  of  the  intrinsic 
viscosity  and  sedimentation  behavior  of  this  material. ^^ 
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(c)  Heat,  alkali,  and  methylation.  Ribonuclease  may  be  progressively  inacti- 
vated, in  an  irreversible  way,  by  exposure  to  heat,  to  alkalinity  above  pH 
12'3-12*7,^*  and  by  esterification  of  free  carboxyl  groups.  These  treatments 
all  lead  to  a  shift  in  spectrum,  the  extent  of  the  shift  being  proportional  to 
the  extent  of  inactivation.  The  titration  studies  of  Tanford  and  Hauenstein^' 
indicate  that  the  irreversible  change  in  spectrum  occurring  at  high  pH  in- 
volves the  modification  of  approximately  3  hydrogen-bonded  tyrosine  resi- 
dues, as  suggested  also  by  the  studies  of  Harrington  and  Schellman^'  on 
the  urea  denaturation  of  ribonuclease.  In  the  latter  case,  however,  the  shift 
in  spectrum  is  reversible  and  is  prevented  by  anions.^* 

{d)  Acid,  subtilisin  digestion,  oxidation,  and  reduction.  Scheraga^^  has  shown 
that  a  spectral  shift  takes  place  when  ribonuclease  is  dissolved  in  acid  solu- 
tions and  that  the  extent  of  the  shift  is  pH  dependent  in  a  manner  consistent 
with  the  titration  of  acid  groups  having  pK  values  in  the  neighborhood  of 
pH  2.  These  unusually  acid  groups  may  represent  the  electronegative  groups 
involved  in  hydrogen  bond  formation  with  the  tyrosine  hydroxyl  groups 
mentioned  above, 

Performic  acid  oxidation,  which  causes  complete  inactivation, ^^  produces 
an  irreversible  shift  in  spectrum. ^^  Partial  reduction,  however,  when  carried 
to  a  point  where  approximately  one  disulfide  bridge  per  molecule  has  been 
cleaved,  and  where  no  trace  of  remaining  native  enzyme  can  be  detected 
electrophoretically,  yields  a  product  with  essentially  full  activity  and  which 
shows  a  'native'  spectrum.  ^ 

Subtilisin  digestion  causes  extensive  and  random  digestion  of  the  native 
enzyme  and  during  this  digestion  there  occur  parallel  changes  in  enzyme 
activity  and  in  the  degree  to  which  the  spectrum  has  been  shifted  to  lower 
wavelengths. 

(e)  Guanidine  hydrochloride ^^  Guanidine  hydrochloride  is  known  to  be  a 
much  stronger  denaturing  agent  than  urea  (see,  for  example,^").  It  was 
found  that  6m  solutions  of  this  salt  completely  inactivated  ribonuclease  in 
contradistinction  to  the  results  with  urea.  The  studies  with  guanidine  indi- 
cated that  the  substrate  molecule  was  unable  to  bring  about  refolding  of 
the  polypeptide  chain.  Spectrophotometric,  optical  rotatory,  and  viscometric 
measurements  bear  out  this  conclusion  (see  Table  6).  Optical  dispersion 
measurements  (Table  6  and  Fig.  9)  also  indicate  that  guanidinium  ions  cause 
an  even  greater  disorientation  of  the  secondary  structure  of  ribonuclease 
than  does  urea,  as  shown  by  a  comparison  of  the  rotatory  dispersion  con- 
stants for  the  protein  in  these  two  systems.  In  view  of  these  results  it  was 
of  interest  to  determine  the  level  of  guanidine  hydrochloride  at  which  ribo- 
nuclease is  active.  Since  the  activity  of  ribonuclease  is  strongly  affected  by 
the  ionic  strength  of  the  assay  solution, ^^  and  since  guanidine  hydrochloride 
is  a  strong  salt,  it  was  first  necessary  to  determine  whether  or  not  this  agent 
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had  an  inactivating  effect  over  and  above  that  shown  by  equivalent  levels 
of  sodium  chloride.  The  data  in  Fig.  10  show  that  guanidinium  ions  exert 
a  specific  inactivating  effect,  not  attributable  to  ionic  strength,  in  the  con- 
centration range  between  1  and  3  molar.  It  is  precisely  in  this  range  of 
concentrations  that  low  levels  of  orthophosphate  no  longer  prevent  the  shift 


20 

1                            1                ■ 

1 

1 

30 

B__,_-- 

^^_^^ 

40 

--*-'^            1                          1 

1 

1 

-250  -200  -150  -100  -50 

[q]x 

Fig.  9.  The  optical  rotatory  dispersion  of  ribonuclease  in  :  (A),  water,  (B),  6m  guanidine 
hydrochloride  containing  0-3m  phosphate  buffer,  pH  6-5,  (C),  6m  guanidine  hydrochloride's 
The  data  are  plotted  according  to  the  suggestion  of  Yang  and  Doty.^* 


MOLARITY    OF    SALT 

Fig.  10.  Activity  of  ribonuclease  in  sodium  chloride  solutions  (solid  circles),  guanidine 
hydrochloride  solutions  (open  circles).  Activity  measured  by  alkali  uptake  in  the  Jacobsen- 
Lconis^^  autotitrator  at  pH  6  0  and  plotted  as  the  velocity  constant,  k,  of  the  first  order 
reaction,  in  minutes"^.'^ 

in  spectrum  characteristic  of  the  inactive  enzyme.  We  may  conclude,  there- 
fore, that  the  inactivation  of  ribonuclease  by  guanidinium  ions  is  also  closely 
correlated  with  a  change  in  the  ultraviolet  absorption  properties  of  certain 
tyrosine  residues.  The  guanidine  effect  is  completely  reversible  upon  dilu- 
tion of  the  system  or  following  removal  of  this  material  by  dialysis. 
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CONCLUSIONS 

It  may  be  stated  with  reasonable  certainty  that  the  polypeptide  chain  of 
ribonuclease,  free  of  cross  linkages,  is  not  sufficient  for  catalytic  activity 
since  the  oxidized  and  reduced  forms  of  the  protein  are  completely  inert. 
This  conclusion  is  supported  by  the  results  of  studies  which  indicate  that 
the  methods  employed  for  the  cleavage  of  disulfide  bridges  do  not  chemi- 
cally modify  amino  acids  other  than  cystine.  (One  must,  however,  keep  in 
mind  the  possibiUty  that  the  open  chain  does  possess  intrinsic  enzyme  acti- 
vity but  that  the  formation  of  the  proper  catalytic  configuration  is  prevented 
by  the  presence  of  sulfonic  acid,  carboxymethyl,  or  sulfhydryl  groups.) 

The  weight  of  evidence  now  suggests  that  there  exists  in  the  native  mole- 
cule a  constellation  of  amino  acid  residues,  considerably  less  in  size  than 
the  intact  protein,  which  is  responsible  for  catalytic  activity.  The  results 
of  Richards^^  on  the  limited  degradation  of  the  chain  with  subtilisin,  make 
it  likely  that  the  A^-terminal  portion  of  the  molecule  is  not  involved.  Simi- 
larly, the  last  few  residues  at  the  C-terminal  end  of  the  chain  appear  to  be 
superfluous. 21*  Partial  reduction  of  disulfide  bridges^  also  seems  to  be  per- 
missible. 

The  physical  studies  discussed  above  have  indicated  a  correlation  between 
activity  and  the  presence  of  a  unique  type  of  spectrum,  characteristic  of 
tyrosine  residues  whose  hydroxyl  groups  seem  to  be  linked  through  hydro- 
gen bonding  to  carboxylate  groups  elsewhere  in  the  molecule.  It  cannot, 
of  course,  be  stated  that  such  residues  are  necessarily  implicated  in  the 
actual  catalytic  process.  However,  it  does  seem  likely  that  one  or  more 
hydrogen-bonded  tyrosine  residues  must  be  intimately  concerned  with  the 
'active  center'  of  ribonuclease  and  that  these  exist  in  such  a  linkage  when 
the  geometry  of  the  'center'  is  in  its  proper  configuration. 

These  chemical  and  physical  observations,  when  considered  together  with 
the  fact  that  limited  pepsin  digestion  causes  complete  inactivation  after  the 
removal  of  only  a  small  C-terminal  fragment,^^  lead  us  to  conclude  that 
at  least  a  part  of  the  catalytically  active  portion  of  the  ribonuclease  chain 
resides  near  the  C-terminus.  The  apparent  requirement  for  at  least  two 
intact  disulfide  bridges  suggests  that  such  an  area  of  the  molecule  is  joined 
in  close  proximity  to  one  or  more  other  essential  amino  acid  sequences. 
The  size  and  location  of  these  can  obviously  not  be  considered  until  a  great 
deal  of  further  information  is  available  on  the  relative  importance  of  the 
individual  disulfide  bridges  and  on  the  extent  to  which  the  ribonuclease 
molecule  can  be  further  degraded  without  complete  loss  of  function. 

*  The  word  'superfluous'  must  be  used  in  a  very  qualified  sense,  since  it  appears  rather 
unlikely  that  'superfluous'  structures  would  have  survived  the  attrition  of  mutation  and 
selection  that  has  been  at  work  since  Nature  first  devised  ribonuclease  molecules  if  some 
selective  advantage  were  not  involved. 
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IMMUNOLOGICAL   CRITERIA   OF   HOMOGENEITY 

The  immunological  analysis  of  a  protein  provides  one  of  the  most  sensitive 
methods  for  the  detection  of  macromolecular  impurities.  Diffusion  and  pre- 
cipitation of  antigen  and  antibody  in  agar  (Oudin,  1952)  leads  to  discrete 
precipitin  zones  for  each  antigen,  the  total  number  of  independent  zones 
representing  a  minimum  estimate  of  the  number  of  antigens  present. 

Since  individual  animals  vary  in  their  responsiveness  to  the  same  antigen 
(Sobey,  1954)  and  since  a  massive  antibody  response  occurs  more  readily 
to  some  antigens  than  to  others,  it  is  necessary  to  examine  diffusion  and 
precipitation  in  agar  with  antisera  from  several  individuals  and  with  the 
antisera  from  the  same  animals  after  successive  courses  of  immunization. 

Molecules  which  are  indistinguishable  as  antigens  (Heidelberger,  1954) 
but  are  of  different  mobility  can  be  distinguished  by  electrophoresis  in  agar 
and  by  their  subsequent  reaction  with  antibody  diffusing  from  channels 
placed  parallel  to  the  direction  of  electrophoretic  migration  (Williams  and 
Grabar,  1953;  see  also  Cinader  and  Dubert,  1955). 

The  assay  of  the  nitrogen  contents  of  precipitates  of  antigen  and  antibody 
gives  not  only  information  on  the  combining  ratio  of  reactants,  but  also 
on  the  relative  purity  of  different  preparations  of  the  same  protein.  The 
shape  of  a  plot  of  total  N  precipitated  as  a  function  of  antigen  N  added 
to  a  constant  volume  of  immune  serum  (Cohn,  1952)  and  the  quantity  of 
antigen  leading  to  maximum  precipitation  of  antibody  (Cinader  and  Pearce, 
1956)  can  serve  as  a  sensitive  index  of  relative  purity. 

The  supernatants  from  antigen-antibody  precipitates  may  provide  addi- 
tional information  on  the  heterogeneity  of  the  antigen.  The  simultaneous 
presence  of  antigen  and  antibody  in  the  same  supernatant  has  usually  been 
accepted  as  indicating  the  presence  of  antigenic  contaminants.  However, 
this  is  not  necessarily  so  ;  only  the  examination  of  supernatants  by  diffusion 
and  precipitation  in  agar  can  show  whether  the  presence  of  antigen  and 
antibody  in  the  same  supernatants  is  due  to  soluble  complexes  of  the  prin- 
cipal antigen  and  its  antibody  or  to  antigenic  contaminants. 

*  Present  address  :  Microbiological  Research  Establishment,  Porton,  Wiltshire. 
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We  can  also  distinguish  between  preparations  of  different  purity  by  attach- 
ing the  soluble  antigen  to  erythrocytes  by  means  of  tannic  acid  (Boyden, 
1951).  Antisera  to  the  attached  antigen  agglutinate  such  sensitized  erythro- 
cytes. The  agglutination  can  be  inhibited  by  the  addition  of  the  soluble 
antigen  to  the  antiserum  (Cinader  and  Dubert,  1955,  1956;  Cinader  and 
Pearce,  1956).  The  amount  of  protein  required  for  this  inhibition  depends 
on  the  protein  impurities  present  in  a  given  preparation  of  antigen,  that 
is  on  the  percentage  of  total  protein  effective  as  inhibitor  (Cinader  and 
Pearce,  1956). 

When  the  antigen  is  an  enzyme,  the  precipitation  of  antigen  should  run 
parallel  with  the  disappearance  of  enzyme  activity  from  the  supernatants; 
when  antigen  can  no  longer  be  demonstrated  in  the  supernatant,  enzyme 
activity  should  also  be  minimal.  A  different  sequence  of  events  indicates 
the  presence  of  protein-impurities  without  enzyme  activity  (Cohn,  1954). 
None  of  the  approaches  can  be  safely  interpreted,  if  used  alone;  each  of 
them  should  contribute  its  information  to  the  final  correlation  of  evidence 
relating  to  homogeneity. 

The  following  immunochemical  experiments  with  ribonuclease  are  con- 
cerned with  the  homogeneity  of  the  antigen,  but  it  also  forms  the  continua- 
tion of  studies  of  antibody  as  an  inhibitor  of  enzyme  (Cinader,  1955,  1957), 

THE    INTERACTION    OF    RIBONUCLEASE 
WITH    ANTIBODY 

Crystallized  bovine  ribonuclease  consists  of  a  major  component,  the  enzyme, 
and  of  a  minor  component  which  is  antigenically  distinct  from  the  enzyme. 
The  presence  of  these  two  components  can  be  demonstrated  by  the  reaction 
in  agar  between  antibody  to  crystallized  ribonuclease  and  chromatographi- 
cally  separated  fractions  of  the  crystallized  protein  (Fig.  1).  The  arrange- 
ment in  the  agar  plates  of  reactants  and  the  zone  of  precipitation  formed 
between  them  is  shown  in  Fig.  2.  A  common  zone  corresponding  to  the 
enzyme  runs  between  all  the  cups  containing  antigen  and  the  cups  contain- 
ing the  antibody.  One  chromatographic  fraction,  that  first  eluted  from  the 
chromatographic  column,  contains  a  second  antigen  which  reacts  inde- 
pendently of  the  zone  common  to  all  the  fractions  and  which  runs  across  the 
main  zone  formed  between  ribonuclease  A  and  B  and  the  antiserum. 

Electrophoresis  in  agar  of  crystallized  ribonuclease  followed  by  diffusion 
of  antibody  from  lateral  channels  showed  the  principal  antigen  to  be  in- 
homogeneous  (Cinader,  Rondle  and  Pearce,  1955)  and  to  consist  of  a  fast 
and  a  slow  moving  component;  these  two  components  of  the  enzyme  are 
antigenically  indistinguishable  since  antibody  reacts  with  them  in  agar  with 
the  formation  of  a  continuous  zone  with  two  maxima.  The  minor  antigen 
component  present  in  the  fraction  first  eluted  from  a  chromatographic  column 
migrates  with  the  same  velocity  as  the  slower  of  the  two  electrophoretic 
components  of  the  enzyme. 
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Fig.  1.  Chromatographic  fractionation  of  bovine  ribonuclease  (Armour,  381-59)  by  the 
method  of  Hirs,  Moore  and  Stein  (1953).  Numbers  refer  to  the  contents  of  individual 
tubes  used  in  the  experiments  shown  in  Fig.  2.  'A'  indicates  the  tubes  pooled  for  the 
determinations  in  Tables  1  and  2. 


Fig.  2.  Drawing  of  precipitin  zones  apparent  in  agar  after  3  days  of  development.  Circles 
denote  the  position  of  cups  in  the  agar.  S . . . .  indicates  the  position  of  cups  filled  with 
ribonuclease  antiserum,  t shows  cups  containing  chromatographic  fractions  of  ribonu- 
clease (Armour,  381-59);  numbers  refer  to  tube  numbers  (Fig.  1). 
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With  some  preparations  of  crystallized  ribonuclease  a  third  electrophoretic 
component  is  found  to  move  at  a  greater  velocity  than  the  faster  of  the  two 
components  already  mentioned  and  with  one  preparation  of  ribonuclease 
a  fourth  component  of  very  low  mobility  was  encountered.  These  two  com- 
ponents are  antigenically  identical  with  the  two  principal  electrophoretic 
components  of  the  enzyme. 

Ribonuclease  A  was  found  to  consist  of  one  antigenic  species  which  mig- 
rated in  agar  with  the  same  two  mobilities  as  the  two  components  usually 
observed  in  the  electrophoresis  in  agar  of  the  crystallized  enzyme.  The 
possibility  that  these  two  electrophoretic  components  were  artifacts  of  electro- 
phoresis could  be  excluded.  Immediately  after  the  electrophoresis  was  com- 
pleted, portions  of  the  agar  were  cut  out  in  positions  corresponding  to  the 
predetermined  areas  of  highest  antigen  concentration;  these  were  inserted 
into  fresh  agar  slides  and  subjected  to  electrophoresis.  At  the  end  of  this 
second  run  antiserum  was  allowed  to  diffuse  through  the  agar  from  a  median 
channel  ;  each  component  of  the  enzyme  was  found  to  react  with  the  forma- 
tion of  a  zone  showing  one  maximum  only  and  hence  to  react  as  a  single 
electrophoretic  component. 

It  is  unlikely  that  a  second  electrophoretic  component  was  present  through 
contamination  from  the  neighbouring  chromatographic  component  of  ribo- 
nuclease B  :  we  have  observed  two  components  not  only  with  a  pool  of  the 
contents  of  tubes  of  effluent  corresponding  to  ribonuclease  A,  but  also  with 
the  contents  of  the  tube  corresponding  to  the  maximum  of  the  chromato- 
graphic peak  of  ribonuclease  A  (Fig.  1).  We  cannot  exclude  the  possibility 
that  a  second  electrophoretic  component  may  have  been  formed  after  ribo- 
nuclease A  had  been  separated  chromatographically. 

Supernatants  from  precipitates  formed  by  varying  quantities  of  ribonu- 
clease A  and  constant  quantities  of  antiserum  were  examined  by  precipitin 
test  and  by  diffusion  and  precipitation  in  agar.  In  no  case  could  the  co- 
existence of  antigen  and  antibody  be  shown  by  precipitin  tests,  or  the  pre- 
sence of  an  antigen  impurity  in  the  equivalence  zone  by  diffusion  in  agar. 
However,  the  co-existence  of  enzyme  and  its  antibody  could  be  shown  in 
supernatants  from  precipitates  of  ribonuclease  A  and  several  antisera,  when 
the  supernatants  were  examined  for  their  power  to  agglutinate  tanned  and 
sensitized  erythrocytes  and  for  their  enzyme  activity.  Thus  antibodies  were 
encountered  which  gave  rise  to  soluble  complexes  of  enzyme  and  antibody 
in  a  region  which  would  be  identified  as  the  equivalence  zone  by  precipitin 
tests  alone  (Cinader  and  Pearce,  1956).  Some  antibodies  formed  in  response 
to  repeated  intravenous  injections  with  bovine  ribonuclease  gave  rise  to 
nearly  50%  of  soluble  complexes  of  antigen  and  antibody.  It  is  clear  that 
true  combining  ratios  of  antigen  and  antibody  cannot  be  directly  determined 
with  sera  of  this  nature.  We  therefore  selected  for  the  measurement  of  com- 
bining ratios  an  immune  serum  which  showed  an  equivalence  zone  by  all  the 
criteria  employed.  This  serum  was  obtained  from  a  rabbit  by  immunization 
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with  Freund's  adjuvant  followed  by  several  courses  of  intravenous  injections. 
The  combining  ratios  of  antibody  and  antigen  were  obtained  directly  in 
excess  antibody  (Table  1)  and  indirectly  in  excess  antigen  (Table  2),  using 
a  procedure  described  by  Heidelberger  and  Kendall  (1935);  see  also  Kabat 
and  Mayer  (1948). 

ANTIBODY   AS    ENZYME-INHIBITOR 
(see  also  Cinader,  1957) 

The  enzyme  action  of  ribonuclease  is  inhibited  by  antibody.  This  is  best 
shown  by  adding  increasing  amounts  of  the  antibody  to  a  constant  amount 
of  enzyme  (Cinader,  1953,  1955).  A  plot  of  enzyme  activity,  as  a  function 
of  the  quantity  of  antibody  added,  shows  initially  a  linear  decrease  and 
finally  levels  out  in  excess  antibody  at  a  fairly  constant  residual  activity. 
This  final  constant  level  of  activity  is  not  the  same  for  every  serum  and 
varies  during  the  course  of  immunization.  Initially  only  60-70%  of  the  en- 
zyme activity  may  be  inhibited  by  excess  antibody,  but  antibody  formed 
after  repeated  courses  of  immunization  inhibits  95-98%  of  the  activity  of 
the  enzyme.  The  changes  in  inhibitory  power  of  a  series  of  bleedings  from 
a  rabbit  are  shown  in  Fig.  3.  The  rabbit  was  first  injected  subcutaneously 
with  Freund's  adjuvant  and  received  later  successive  courses  of  intravenous 
injections.  The  marked  change  of  inhibitory  power  is  not  due  to  a  decrease 
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Fig.  3.  Changes  in  inhibiting  potency  of  antibody  produced  in  response  to  repeated 
courses  of  immunization. 

The  effect  of  the  addition  of  increasing  volumes  of  immune  serum  to  constant  quantities 
of  bovine  ribonuclease. 

Ribonuclease  antisera  from  rabbit  619 
Symbol  Date  of  bleeding 

— O—  10.7.54 

— X —  2.8.54 

— i—  22.9.54 

-/\-  1.6.55 
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Table  1 

PRECIPITATION    OF    RIBONUCLEASE    BY    RABBIT    ANTIBODY 

Addition  of  increasing  amounts  of  ribonuclease  to  005  ml.  of  serum  619 
Rn  =  Ribonuclease 
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Antigen 

N 

(Mg-) 

Total  N 
preci- 
pitated 
(/ig-) 

Ratio 
antibody 

N  to 
antigen 
Nin 
preci- 
pitate 

Molecular  ratio* 
(antibody  to 
antigen)  in: 

Tests  on  supernatant 

Antigen 

Antibody 

Preci- 
pitate 

Super- 
natant 

Floccu- 
lation 

Enzyme 
activity 
as  ixg. 
N  RnA 
per  3  ml. 

Floccu- 
lation 

Aggluti- 
nation 
titre 

1-0 

2-0 

3  05 

3-95 

4-9 

5-9 

7-4 

8-9 

9-8 

10-2 

13-2 

15-3 

17-3 

19-3 

34-4 

66-8 

101  0 

118-9 

135-2 

148-2 

177-6 

185-7 

191-4 

188-4 

138-5 

106-4 

83-5 

69-2 

33-4 
32-6 
321 
291 
26-4 
24  0 
22-9 
19-9 
18-5 
17-7 
15-9 
16-7 
17-8 
17-0 

2-86 
2-79 
2-74 
2-49 
2-26 
2  05 
1-96 
1-70 
1-58 
1-51 
1-36 
1-43 
1-52 
1-45 

0-87 
0-75 
0-71 
0-65 

+ 
+ 
+ 
+ 

+ 

0-5 

3-8 

7-2 

10-2 

13-6 

+ 
+ 
+ 
+ 

+ 

200 

200 

200 

50 

50 

10 

1 

<1 

<1 

<1 

<1 

<1 

<1 

*  Calculated   assuming   identical   N    content   for  ribonuclease  and  antibody,  a  molecular 
weight  13,680  for  ribonuclease  (Hirs,  Moore  and  Stein  1956)  and  of  160,000  for  rabbit  antibody. 


Table  2 

MOLECULAR    RATIO    (RABBIT    ANTIBODY/RIBONUCLEASE    A)    IN 
REGION    OF    ANTIGEN    EXCESS;    SERUM    619 

Rn= Ribonuclease  A 


RnN 

added 
to016 
ml.  of 
serum 
(/ig.) 

Total 
Nin 
ppt. 
(/ig.) 

Specific 
Nin 
super- 
natant 
(/ig.) 

Vol. 

anti- 
serum 
added 

to 
super- 
natant 

(ml.) 

Total  N 
in  ppt.* 

from 
analysis 

of 
super- 
natant 
(/ig-) 

Antibody 
N  preci- 
pitatedf 
from 
serum 
added  to 
super- 
natant 
(/ig.) 

Per 

cent 
RnN 
in  2nd 

ppt. 

RnN 

in 
entire 
super- 
natant 
(/ig.) 

RnN 
in 

ppt. 
(/ig.) 

Molecular 

ratio  of 

antibody 

to  antigen 

in  ppt. 

32-6 
42-4 
48-9 
55-4 
61-9 

602-7 
443  1 
340-5 
267-2 
221-5 

110 
180-4 
289-5 
369-3 
421-5 

0028 

012 

0188 

0-228 

0-252 

9-0 
350-2 
594-6 
796-6 
936-7 

1-7 
229-9 
401-5 
550-4 
655-7 

2-2 

3-1 

3-4 

3-45 

3-5 

0-4 
19-3 
30-65 
41-2 
49-6 

32-3 
26-3 
18-3 
14-2 
12-3 

1-51 
1-36 
1-43 
1-52 
1-45 

*  All  values  determined  on  2/3  of  supernatant. 
t  From  serum  added  to  supernatant. 
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in  soluble  antigen-antibody  complex  but  to  a  decrease  in  the  enzyme  activity 
of  the  precipitates  of  antigen  and  antibody. 

So  far  little  is  known  from  direct  experiments  about  the  specificity  of 
antibodies  to  native  proteins.  Experiments  by  Landsteiner  and  van  der 
Scheer  (1934)  and  of  Landsteiner  (1945)  with  peptide  azo  proteins  have 
indicated  that  the  specificity  of  the  antibody  may  correspond  to  at  least 
5  amino  acids,  but  this  is  a  minimum  estimate  in  a  system  which  may  not 
represent  an  adequate  model  for  a  native  protein.  The  work  of  Porter  pre- 
sented in  this  symposium  (p.  290)  and  observations  recently  published  by 
Lapresle  (1955)  thus  constitute  a  valuable  advance  of  this  problem. 

The  studies  of  Moore  and  of  his  colleagues  are  rapidly  unravelling  the 
detailed  structure  of  ribonuclease,  thus  presenting  us  with  known  polypep- 
tides which  might  be  used  to  inhibit  the  reaction  between  ribonuclease  and 
its  antibody.  The  enzyme  nature  of  ribonuclease  would  permit  studies  in 
which  antibody  action  could  not  only  be  inhibited  in  terms  of  precipitation 
between  enzyme  and  antibody,  but  also  in  terms  of  the  effect  of  antibody 
on  enzyme  action. 

Thus  the  collaboration  of  enzyme-chemist  and  immuno-chemist  may  not 
only  advance  our  knowledge  of  antibody  specificity  but  may  even  result  in 
some  information  on  the  nature  of  the  catalytic  site  of  enzymes. 
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Thanks  to  electron  microscopy  the  shape  of  the  tobacco  mosaic  virus  rod 
is  now  so  well  known,  as  to  require  hardly  any  recapitulation.  The  300  m/x 
long  particle  consists  to  approximately  94%  of  protein  and  to  6%  of  nucleic 
acid.  The  protein  is  generally  regarded  as  forming  the  outer  shell  around  the 
nucleic  acid  cylinder,  although  recent  X-ray  data  indicate  that  the  nucleic 
acid  is  deeply  embedded  in  the  protein,  and  may  actually  approach  the 
periphery  where  the  surface  is  deeply  grooved.  Within  the  past  two  years 
it  has  clearly  been  demonstrated  that  the  nucleic  acid  alone  can  initiate 
infection  and  that  it  carries  all  the  necessary  'information'  for  the  produc- 
tion of  complete  progeny  virus. ^'^  The  efforts  of  virologists  are,  therefore, 
now  centering  on  this  crucial  minor  constituent  of  the  virus.  However,  the 
role  played  by  the  protein  moiety  in  various  important  regards  must  not  be 
overlooked.  In  quantitative  respects,  the  protein  contributes  to  viral  infec- 
tivity,  since  it  potentiates  the  action  of  the  nucleic  acid  20-50  fold,  possibly 
by  giving  it  protection  and  rigidity.*-^  The  protein  also  accounts  for  the 
serological  activity  of  viruses  and  thus  gives  us  a  tool  with  which  to  combat 
them.  And  finally,  the  protein  probably  influences  host  range  specificities  and 
virulence.  It  is  of  particular  interest  that  the  protein  components  have  been 
shown  to  differ  in  specific  and  characteristic  manner  from  one  virus  strain 
to  another.^''  The  protein  may,  therefore,  be  regarded  as  a  phenotypic 
expression  of  the  chemically  still  obscure  genotypic  characteristics  of  viruses. 
This  symposium  has  clearly  documented  our  advances  towards  the  goal  of 
complete  structural  analysis  of  proteins.  The  possibility  exists,  therefore, 
that  the  structure  of  the  virus  protein  may  be  elucidated  within  the  next 
few  years.  The  virus  may  then  become  the  Rosetta  stone  of  biochemical 
genetics,  in  supplying  the  bilingual  record  needed  to  decipher  nucleic  acid 
in  terms  of  protein  structure. 

While  various  estimates  of  the  virus  particle  weight  range  from  40  to 
50  million,  it  has  been  recognized  for  several  years  that  each  rod  represents 
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an  aggregate  of  much  smaller  protein  and  nucleic  acid  subunits  or  mole- 
cules. The  minimal  molecular  weight  of  the  protein  subunit,  based  on  amino 
acid  analyses,  is  about  18,000.  One  cysteine  and  two  lysine  residues  occur 
in  all  strains  of  TMV,  and  one  histidine  in  the  H.R.  strain,  in  a  unit  of  about 
18,000  gms.^'^''  That  this  approximates  the  true  molecular  weight  of  the 
smallest  unit  was  strongly  suggested  by  the  discovery  of  Harris  and  Knight 
that  carboxypeptidase  releases  only  threonine  and  no  other  amino  acid  from 
the  virus  and  that  the  rapidly  reached  maximum  amount  of  threonine  cor- 
responds to  one  residue  per  17,300^-^  or  about  2900  residues  per  virus 
particle.  It  has  since  been  established  beyond  doubt  that  this  threonine 
residue  represents  a  C-terminal  group,^"-^^  and  that  the  virus  is  actually 
composed  of  about  2900  separate  peptide  chains,  held  together  only  by 
secondary  forces.  Since  'macro-structure'  is  the  main  function  of  this  pro- 
tein, it  is  not  surprising  that  those  secondary  binding  affinities  are  much 
in  evidence,  so  that  the  subunits  are  hard  to  separate,  and  even  harder  to 
keep  separated  under  conditions  favorable  for  physicochemical  characteriza- 
tion. Thus,  only  rough  approximations  of  the  subunit  weight  have  as  yet 
been  made  on  the  basis  of  sedimentation  studies,  ^^  but  these  support  the 
concept  that  single  peptide  chains  supply  the  bricks  building  up  the  super- 
molecule. 

DEGRADATION    OF   VIRUS    AND    ISOLATION 
OF    PROTEIN 

A  variety  of  methods  has  been  used  for  the  degradation  of  the  intact  virus 
and  for  the  isolation  of  disaggregated  nucleic  acid-free  protein.  Detergents, 
such  as  sodium  dodecyl  sulfate  (SDS),  represent  the  most  efficient  agents 
for  the  disruption  of  all  interunit  bonds,  but  the  protein  obtained  from  such 
reaction  mixtures  shows  many  undesirable  features,  some  of  which  will  be 
referred  to  later.  It  appears  that  these  agents  break  not  only  inter-chain 
but  also  intra-chain  secondary  bonds.  Thus  the  product  is  not  the  expected 
mixture  of  sticky  bricks,  i.e.,  subunits  of  definite  shape,  but  a  mixture  of 
sticky  threads,  i.e.,  denatured  peptide  chains,  with  a  tendency  to  unlimited 
random  aggregation  into  a  most  intractable  and  insoluble  material. 

Gentler  methods  for  the  isolation  of  the  protein  make  use  of  alkali  which 
was  found  by  Stanley  to  degrade  the  virus. ^^  A  commonly  used  procedure 
was  proposed  by  Schramm  et  al.^^  in  which  the  virus  is  solubilized  at  about 
pH  10-5  in  the  cold.  Carbonate,  glycine,  and  more  recently  alkanolamine 
buffers  have  been  used  for  this  purpose. ^^  Harrington  and  Schachman  have 
studied  the  kinetics  of  this  degradation,  and  have  demonstrated  the  great 
tendency  for  reaggregation  of  split  products  in  the  alkaline  reaction  mix- 
tures, particularly  at  higher  temperatures.^^  Ammonium  sulfate  can  con- 
veniently be  employed  to  separate  the  protein  cleanly  from  the  nucleic  acid 
after  all  types  of  degradation.  The  final  protein  preparation  obtained  from 
alkali-degraded  virus  appears  'native'  in  various  respects,  to  be  discussed 
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below,  and  gives  water-clear  solutions  except  in  the  isoelectric  region  (pH 
3-5-6-5).  In  that  pH  range  there  occurs  a  remarkable  reaction:  The  solution 
becomes  opalescent  and  its  'virus-like'  appearance  is  borne  out  by  electron 
micrographs  which  show  that  most  of  the  material  is  again  in  the  form  of 
rods  of  great  variability  in  length,  but  exactly  the  same  diameter  as  the 
original  virus.  The  shortest  rods  often  stand  upright  and  appear  as  circular 
disks  with  a  central  perforation.  These  have  been  referred  to  as  doughnuts. 
Thus,  the  prime  biological  activity  of  this  particular  protein,  namely  its 
tendency  to  controlled  aggregation  resulting  in  a  particular  macro-structure, 
has  not  been  destroyed  in  the  course  of  degradation  and  separation  of  the 
nucleic  acid.  The  sedimentation  coefficient  of  this  material  in  solution  (usually 
determined  at  about  pH  9)  resembles  that  of  Schramm's  electrophoretically 
separated  A-protein  (about  4  S)  and  suggests  a  molecular  weight  of  about 
100,000  corresponding  to  six-chain  aggregates.  It  appears  quite  possible  but 
by  no  means  proven  that  units  of  such  a  size  are  required  for  the  proper 
functioning  of  this  protein. 

A  convenient  method  for  the  breakdown  of  the  virus  and  isolation  of  the 
protein  has  recently  been  found  which  has  the  particular  advantage  of  great 
simplicity.^'  The  virus  is  degraded  by  means  of  cold  67%  acetic  acid,  the 
precipitate  which  flocculates  in  a  few  minutes  is  centrifuged  off  (largely 
nucleic  acid),  and  the  protein  which  alone  remains  in  the  water-clear  solu- 
tion is  isolated  after  dialysis.  The  resulting  protein  preparation  appears  indis- 
tinguishable in  all  respects  from  that  obtained  by  alkali  spHtting  and  am- 
monium sulfate  fractionation.  This  is  at  present  the  method  of  choice  in 
our  laboratory.* 

TERTIARY    STRUCTURE    OF    VIRUS    PROTEIN 

From  the  preceding  description  it  is  clear  that  the  protein  as  isolated  from 
dilute  alkali  or  from  acetic  acid  must  retain  a  definite  shape  which  enables 
it  to  bridge  the  gulf  between  a  100,000  molecular  weight  'Pseudoglobulin' 
and  a  virus  rod  particle  with  a  mol.  wt.  of  50  million.  What  type  of  bonds 
are  involved  in  maintaining  its  structure?  This  question  has  repeatedly  been 
attacked  in  our  laboratory.  One  amino  acid  side  chain  which  is  very  prob- 
ably implicated  in  this  structure  is  the  cysteine-SH  group.  This  group  was 
found  to  exist  in  a  form  which  protects  it  from  many  — SH  reagents  of 
various  types,  including  oxygen,  and  thus  renders  it  as  stable  as  the  masked 
— SH  groups  of  many  other  proteins. ^^^^  On  the  other  hand  the  — SH 
group  can  readily  be  titrated  after  unmasking  it  by  the  usual  denaturing 
agents  (such  as  guanidine  hydrobromide),  and  represents  a  convenient  means 
of  estimating  as  to  what  extent  a  protein  preparation  has  retained  its  original 
native  character.  It  was  found  to  be  quantitatively  present  (one  per  18,000 

*  It  has  only  recently  come  to  our  attention  that  acetic  acid  was  used  by  Bawden  and 
Pirie  in  1937  for  the  degradation  of  TMV,  although  not  as  a  means  of  preparing  the  pure 
virus  protein.18 
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mol.  wt.)  in  the  two  types  of  soluble  protein  described  above, ^'  but  absent, 
presumably  autoxidized,  in  the  detergent  type  of  preparation. 

Further  studies  of  this  group  have  shed  new  light  on  its  mode  of  linkage, 
and  probably  of  masked  protein  — SH  groups  generally.^^  For  it  has  be- 
come evident  that  the  H  atom  of  this  masked  group  can  be  replaced  by 
CHsHg —  or  by  an  iodine  atom  without  apparent  disturbance  of  protein 
structure.  The  substitution  by  iodine  is  particularly  surprising  since  sulfenyl- 
iodides  are  normally  quite  unstable  and  decompose  within  a  few  seconds 
in  aqueous  solvents.  In  the  native  virus,  in  contrast,  the  sulfenyliodide  group 
is  stable,  but  decomposes  rapidly  if  the  virus  is  denatured  by  agents  such  as 
guanidine  hydrobromide. 

It  must  be  concluded  from  these  results  that  the  masked  — SH  groups 
(1)  occur  as  such  in  the  protein  and  (2)  probably  represent  the  acceptors 
rather  than  donors  of  hydrogen-bonded  structures  involving  other  as  yet 

H 

/ 
unknown  groups  ( — S- — H X),  This  linkage  must  be  of  unusual  firm- 
ness in  the  present  case  since  the  masked  — SH  groups  of  ovalbumin  and 
a  few  other  proteins  that  were  tested  did  not  give  stable  sulfenyliodide  deriva- 
tives, although  their  reactivity  otherwise  resembled  that  of  TMV. 

Another  type  of  tertiary  bond,  which  is  at  present  receiving  much  atten- 
tion from  protein  chemists,  is  the  hydrogen-bonded  interaction  of  the  phe- 
nolic groups  of  tyrosine  residues  with  the  carboxylate  ion  groups  of  glutamic 
or  aspartic  acid  residues.  Evidence  for  the  presence  of  such  groups  is  often 
based  on  the  spectral  shift  to  longer  wavelengths  observed  for  tyrosine  so 
bonded.  If  this  spectral  shift  is  largely  reversed  in  acid  solution,  and  if  the 
pK  of  this  effect  is  within  the  range  of  that  of  carboxyl  groups,  then  a 
tyrosine-carboxylate  interaction  is  postulated. ^^'^^  In  the  case  of  the  native 
TMV  protein  the  spectral  divergence  from  that  of  an  ad  hoc  prepared  amino 
acid  mixture  has  already  been  pointed  out^  as  well  as  the  correction  of  this 
divergence  by  the  use  of  an  acidic  solvent.  Difference  spectra  have  now 
been  plotted  for  the  protein  at  pH  7  as  compared  to  pH  2-2,  and  these 
were  found  to  resemble  closely  those  obtained  with  ribonuclease,  insulin, 
etc.  (Fig.  1).  Thus  it  appears  probable  that  bonds  of  the  tyrosine-carboxylate 
type  do  occur  in  the  isolated  virus  protein  It  must  be  noted,  however,  that 
the  making  and  breaking  of  these  bonds  appears  readily  reversible  and  that 
they  are  not  stable  under  some,  if  any,  of  the  conditions  required  for  the 
degradation  of  the  virus  prior  to  isolation  of  the  protein.  Thus  it  would 
appear  impossible  to  attribute  to  these  bonds  any  major  role  in  maintaining 
the  original  folding  and  shape  of  the  protein.  Nor  can  they  play  part  in  the 
inter-unit  bonding,  unless  originally  intermolecular  bonds  of  this  type  are 
presumed  to  have  exchanged  partners  during  degradation  of  the  virus  to 
give  the  possibly  weaker  intra-molecular  bonds  observed  in  the  final  prepara- 
tion. A  search  for  spectral  differences  between  the  intact  virus  and  that 
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degraded  by  acid  (+SDS)  require  correction  for  the  light  scattering  by  the 
intact  virus.  Comparison  of  the  spectra  of  virus  which  was  degraded  by 
SDS  either  in  an  acid  solution  or  at  pH  8  need  no  such  correction.  In  both 
cases  a  shift  in  the  opposite  direction  from  that  given  by  the  protein  alone 
was  obtained,  the  acid  solution  giving  the  positive  difference  spectrum  with 
a  maximum  about  290  m/x.  However,  virus  nucleic  acid  alone  was  found  to 
show  a  similar  behavior,  and  its  presence  makes  such  reaction  mixture  too 
complex  to  attempt  any  interpretation  of  the  spectral  changes. 
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Fig.  1.  Ultraviolet  absorption  of  TMV-protein  in  67%  acetic  acid,  HCl  (pH  2-2)  and 
in  water  at  neutrality. 


NATURE    OF    INTER-UNIT    BONDS 

Previous  studies  by  Harrington,  Levy  and  Schachman^*  have  indicated  that 
OH~  ions  are  rapidly  consumed  when  TMV  is  degraded  in  weakly  alkaline 
solutions.  These  studies  have  now  been  extended  to  other  experimental  con- 
ditions leading  to  the  splitting  of  the  virus.  One  type  of  experiment  is  based 
on  the  new  finding  that  the  virus  is  degraded  rapidly  at  room  temperature 
in  1%  SDS  adjusted  to  pH  3-5  with  dilute  HCl.  (Nucleic  acid  of  similar  or 
slightly  higher  infectiousness  as  the  standard  preparations  can  be  isolated 
from  such  reaction  mixtures,  as  well  as  nucleic  acid-free,  though  denatured, 
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protein.)  When  the  reaction  mixture  containing  the  degraded  virus  was  sub- 
sequently titrated  back  to  the  original  pH  (usually  pH  7-0),  more  alkali 
was  required  than  that  equivalent  to  the  acid  used  to  reach  the  pH  at  which 
the  virus  was  degraded.  The  difference  was  similar  to  the  alkali  consump- 
tion observed  in  the  earlier  experiments, 2*  and  corresponded  to  about  1-5 
/xmole  per  18  mg  virus.  Subsequent  titrations  between  pH  7-0  and  3-5  re- 
quired equivalent  amounts  of  acid  and  alkah,  and  followed  the  same  curve 
as  the  first  back  titration  (Fig.  2).  An  alternative  procedure  is  to  perform 


Fig.  2.  Alkali  consumption  during  degradation  of  TMV  by  1  %  sodium  dodecyl  sulfate 
below  pH  4,  as  indicated  by  back  titration  to  original  pH. 


the  customary  degradation  with  SDS  at  about  pH  7-5  and  40°  in  a  pH-stat 
and  note  the  amount  of  alkali  required  to  maintain  a  constant  pH.  Such 
experiments  have  given  an  average  alkali  consumption  of  1-9  /xmole/ 18  mg 
virus  (Fig.  3). 

A  working  hypothesis  which  could  account  for  these  observations  is  as 
follows:  A  certain  number  of  carboxyl  groups  of  the  virus  (probably  2  per 
subunit)  are  hydrogen  bonded  in  the  undissociated  form  to  some  other, 

OH-X— 

y      I 

possibly  tyrosine,  groups  ( — C  =  0- — H  ).  This  bond  is  so  stable  as  to 
prevent  the  carboxyl  groups  from  dissociating  until  disaggregation  of  the 
macro-molecule  re-establishes  their  normal  pK  and  titrability.  Carboxyl- 
tyrosine  bonds  have  been  proposed  by  Edelhoch  for  pepsin, ^^  and  are  in 
that  case  beheved  to  be  stable  up  to' about  pH  6,  at  which  point  pepsin 
becomes  denatured.  In  the  case  of  TMV  one  would  have  to  assume  an 
extension  of  the  pH  stability  range  of  such  groups  to  almost  pH  9.  Inter- 
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actions  between  tyrosine  and  undissociated  carboxyl  groups  have  been  ob- 
served also  in  model  experiments  of  Laskowski,-*'  and  there  are  indications 
that  they  may  produce  much  smaller  spectral  shifts  than  does  the  carboxylate- 
tyrosine  interaction.  However,  such  an  effective  masking  of  protein  carboxyl 
groups  has  not  previously  been  demonstrated  and  will  have  to  be  supported 
by  much  more  experimental  evidence  than  is  available  at  present. 


15 


10 


91  mg  TMVr  BiaM  subunits 

Constant  pH  =  7,5 

T=41° 


+  SDS 


10 


15 


0  5 

MINUTES 

Fig.  3.  Alkali  consumption  during  degradation  of  TMV  in  1  %  sodium  dodecyl  sulfate 
at  a  constant  pH  (7-4). 

A  recent  series  of  observations  appears  to  both  confirm  and  extend  the 
information  concerning  particular  inter-subunit  carboxyl  binding  sites  in  the 
virus.  The  X-ray  diffraction  data  of  Caspar-''  have  shown  that  lead  is  bound 
by  the  virus  stoichiometrically,  and  in  two  distinct  localizations.  Unpub- 
lished observations  from  this  laboratory  have  indicated  that  both  these  sites 
differ  from  the  SH  group  which  is  not  substituted  by  lead.  It  has  also  been 
observed  that  the  addition  of  lead  acetate  (pH  6)  to  virus  at  pH  6  produces 
a  sharp  drop  in  pH  which  is  compensated  by  an  equivalent  amount  of 
alkali.  This  could  be  regarded  as  evidence  that  lead  can  replace  the  two  non- 
dissociated  carboxyl  hydrogen  atoms  of  each  subunit.  Support  for  this  con- 
cept comes  from  the  finding  that  no  alkali  is  consumed  upon  the  subsequent 
addition  of  SDS,  or  at  pH  9  (40°).  The  supposed  replacement  of  the  hydrogen 
bond  by  a  lead  atom  does  not  weaken,  but  rather  strengthens,  the  structure 
of  TMV  as  indicated  by  the  extent  of  splitting  of  protein  and  nucleic  acid 
under  standard  conditions  of  degradation.  Other  divalent  metals  giving  stable 
acetates  (pH  5 -5-6 -5)  such  as  zinc  and  magnesium,  react  similarly  to  lead, 
although  with  decreasing  affinity.  This  was  indicated  by  slower  and  less 
stoichiometric  interaction  (approaching  only  one  site  with  Mg++),  and  less 
stabilization  upon  splitting.  Also  subsequently  added  lead  produced  a  further 
pH  drop,  while  other  metal  salts  added  after  lead  acetate  had  no  such  effect. 
The  particular  affinity  of  lead  for  carboxyl  groups  is  in  accord  with  the 
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finding  of  Gurd  and  Murray,^^  but  this  appears  to  be  the  first  case  where 
lead  and  to  a  lesser  extent  other  divalent  ions  can  enter  into  protein  struc- 
ture by  replacing  a  normally  non-dissociable  carboxyl  hydrogen  atom. 

One  alternative  possibility  which  could  account  for  the  presented  observa- 
tions is  that  about  5000  secondary  phosphate  — OH  groups  ( — R — O — P — 0~), 

/  \ 

ICH] 


rather  than  carboxyl  groups,  prevented  from  dissociation  by  their  involve- 
ment in  structural  stabilization,  might  acquire  their  normal  dissociability 
upon  degradation  of  the  virus.  Since  each  virus  particle  contains  only  about 
10,000  nucleotides,  most  if  not  all  phosphate-diester  linked,  this  explana- 
tion appears  to  be  ruled  out.  Unusual  bonds  as  numerous  as  the  titrations 
indicate  can  only  be  located  in  the  protein.  And  unusual  bonds  of  some 
kind  or  other  will  probably  be  required  to  explain  the  unusual  stability  of 
the  intact  virus  as  compared  with  its  major  component,  the  protein.  Simple 
aggregation,  as  observed  with  so  many  proteins,  does  not  generally  evoke 
major  changes  in  character.  Yet  the  virus  is  stable  at  temperatures  above 
60°,  while  the  isolated  protein  becomes  denatured  below  40°.^^  Differences 
in  resistance  towards  enzymes  and  other  agents  may  be  expressions  of  the 
same  phenomenon.  The  fact  that  the  ease  of  degradation  of  the  virus  differs 
for  different  strains,  as  well  as  for  chemical  derivatives  of  the  same  strain, 
also  points  to  the  existence  of  a  definite  determining  and  limiting  factor  in 
the  inter-unit  bonding.  It  has  also  been  noted  that  the  electrophoretic  mobili- 
ties of  the  virus,  as  well  as  of  various  virus  protein  aggregates,  are  often 
higher,  i.e.,  the  protein  is  less  acidic  than  one  would  expect  from  the  amino 
acid  composition.  An  unexplained  shift  in  the  opposite  direction  was  ob- 
served for  the  enzymatically  dethreoninated  virus.^  It  would  appear  that 
carboxyl-masking  could  be  a  determining  factor  in  the  observed  electro- 
phoretic differences,  as  well  as  in  immunological  and  other  particle  surface 
properties. 

STUDIES    OF    THE    PRIMARY    STRUCTURE    OF    TMV 

During  the  past  years  various  studies  have  been  initiated  by  several  investi- 
gators in  the  University  of  California  Virus  Laboratory  to  solve  structural 
and  peptide  sequence  problems  connected  with  the  TMV  protein.  The 
approach  is  by  now  almost  conventional.  The  steps  involved  are,  (1)  enzy- 
matic degradation  of  the  protein  ;  (2)  separation  of  pure  peptide  fragments  ; 
(3)  elucidation  of  their  composition  and  sequence;  and  (4)  fitting  together 
of  the  pieces  on  the  basis  of  overlaps  supplied  by  the  two  or  more  sets  of 
peptides  obtained  through  the  use  of  two  or  more  different  enzymes. 

(1)  Considerable  effort  has  been  made  to  use  the  SDS-denatured  protein, 
which  is  a  by-product  of  the  nucleic  acid  preparation,  for  these  studies. 
However,  the  susceptibility  of  this  material  to  enzymatic  digestion  has  been 
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found  to  be  somewhat  erratic.  Digestion  of  the  acetic-acid  prepared  protein 
with  1%  trypsin  (Worthington,  cryst.)  at  37°,  i.e.,  near  its  point  of  heat 
d^naturation,  for  about  8  hours  has  given  the  most  reproducible  resuUs. 
The  ninhydrin  analyses  indicate  the  splitting  of  a  maximum  of  about  12 
bonds  (leucine  equivalents).  The  C-terminal  groups  found  in  hydrazinolysis 
experiments  in  the  acid-soluble  fraction  of  the  digest  were,  besides  the 
original  C-terminal  threonine,  in  average  about  9  arginine,  1-3  lysine,  and 
0-25  residues  of  leucine  and  serine.  This  indicates  a  high  degree  of  specificity 
for  the  enzyme  used,  but  falls  slightly  short  of  quantitative  digestion  [ex- 
pected: 11  arginine,  2  lysine  groups^''];  but  the  high  correction  factors,  par- 
ticularly for  arginine,^^  lessen  the  strictly  quantitative  significance  of  these 
values.  The  A^-terminal  groups  found'^''-^^  are  about  one  each  of  aspartic 
and  glutamic  acid,  serine,  threonine,  phenylalanine,  and  tyrosine,  possibly 
two  of  glycine,  and  less  than  one  of  alanine,  valine  and  leucine,  in  general 
quantitative  accord  with  the  sum  of  C-terminal  residues. 

Chymotrypsin  digestion  liberated  about  14  leucine  equivalents.  C-terminal 
residues  were  (besides  the  original  threonine)  phenylalanine  (6  residues), 
leucine  and  isoleucine  (3),  tyrosine  (3),  tryptophan  (about  1)  [total  of  these 
amino  acids:  8,  13+9,  4  and  2^"],  and  traces  (0-2)  of  serine,  glycine,  valine 
and  proline.  The  A^-terminal  residues  found  in  such  digests  were  aspartic 
and  glutamic  acid  (about  4  and  2),  glycine  and  serine  (2),  and  about  one 
each  of  threonine,  valine,  alanine,  arginine,  leucine  and  phenylalanine. 

(2)  For  separation  of  the  peptides  of  TM V  digests  Dowex  50-X-2  columns, 
as  advocated  by  Hirs,  Moore  and  Stein, ^^  have  been  used  with  some  suc- 
cess.^^  Several  sharp  peaks,  however,  could  be  resolved  into  two  or  three 
components  by  subsequent  paper  chromatography.  At  present  Dr  D.  Gish 
is  subjecting  the  digests  to  countercurrent  distribution  and  appears  to  get 
good  resolution  of  the  mixture,  although  here  also  seemingly  homogeneous 
peaks  may  occasionally  contain  more  than  one  component.  A  two-dimensional 
paper  electrophoresis — paper  chromatography  technique  is  being  appHed  to 
the  problem  by  Knight  and  Woody.  ^^  A  reproducible  pattern  of  spots  indi- 
cating about  12  peptides  is  revealed  by  the  ninhydrin  and/or  the  chlorine- 
iodide  test.^^  This  technique  has  been  used  to  survey  a  number  of  virus 
strains  in  comparative  fashion. ^^  Indistinguishable  peptide  patterns  were 
obtained  for  strains  of  very  similar  amino  acid  composition  (masked  strain 
vs.  common  TMV),  while  differences  were  observed  for  strains  showing  dif- 
ferent composition. 

Most  of  the  peptides  obtained  from  similar  digests  by  the  various  tech- 
niques of  isolation  discussed  above  have  been  cross-identified. 

(3)  The  amino  acid  composition  of  most  peptides  has  been  determined 
by  the  Levy  fluorodinitrobenzene  (FDNB)  method^^-^''  after  desalting  the 
preparations  according  to  Hirs  et  al.,^^  or  more  recently  by  the  method  of 
the  Haugaards.^^  The  TV-terminal  amino  acids  were  determined  by  standard 
procedures.^''  The  C-terminal  amino  acid  was  generally  estabhshed  by  the 

Rps 
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enzyme  specificity,  but  was  occasionally  ascertained  by  hydrazinolysis^^-^^ 
or  carboxypeptidase.  Sequences  have  been  determined  in  only  a  limited 
number  of  peptides,  ranging  from  dipeptides  to  hexapeptides.^*'-^^  One  hexa- 
peptide  isolated  from  chymotryptic  digests  as  the  DNP  derivative  appeared 
of  particular  interest  because  it  proved  to  represent  the  C-terminal  peptide 
of  the  protein. ^^  The  same  hexapeptide  was  obtained  from  the  masked  and 
YA  strains,  but  one  of  slightly  different  composition  was  obtained  from 
the  H.R.  strain. 

(4)  The  identified  fragments  are  devoid  of  key  amino  acids  and  too  small 
to  permit  even  a  beginning  of  the  final  stage  of  the  work,  the  fitting  together 
in  proper  sequence.  The  disturbing  possibility  must  also  be  faced  that  all 
peptide  chains  need  not  necessarily  be  identical.  An  appreciable  hetero- 
geneity within  that  population  would  render  our  aim,  as  now  approached, 
unrealizable.  At  present,  one  can  only  be  sure  that  all  or  almost  all  peptide 
chains  have  the  same  C-terminal  sequence.  It  is  also  definitely  established 
and  agreed*"-*^  that  none  or  almost  none  of  the  chains  has  a  free  A^-terminal 
group.  Several  preliminary  notes  by  Braunitzer  and  Schramm  discussing  the 
nature  of  the  'hidden'  A^-terminal  group  and  describing  a  crystalline  A^- 
terminal  trypsin  fragment  of  the  protein  have  not  been  confirmed.  Some 
of  the  claims  and  ideas  were  most  attractive,  but  they  had  to  be  discarded 
for  lack  of  factual  support. 

The  absence  of  an  A^-terminal  group  raises  the  possibility  that  digestion 
of  the  protein  by  enzymes  other  than  trypsin  might  produce  peptides  which 
are  devoid  of  any  basic  group.  A  search  for  such  peptides  was  therefore 
initiated  by  one  of  us  (K.N.)*^  by  passing  various  digests  through  the  acid 
form  of  Dowex-50-X2  columns.  The  same  main  component  of  this  acid 
peptide  fraction  was  isolated  from  either  chymotryptic  or  peptic  digests,  and 
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Fig.  4.  Ultraviolet  absorption  spectrum  of  DNP-derivative  of  acidic  peptide,  and  of 
0-DNP-tyrosine. 
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further  purified  by  Dowex-2-XlO  chromatography.  It  proved  to  contain  only 
two  amino  acids,  having  the  composition  ser,tyr.  Both  hydrazinolysis  and 
carboxypeptidase  proved  that  the  tyrosine  was  C-terminal  which  also  is  in 
accord  with  known  enzyme  specificities.  However,  the  status  of  the  A^- 
terminus  of  this  simple  peptide  represented  a  considerable  mystery.  As  stated, 
its  mode  of  isolation  precluded  the  presence  of  a  basic  group.  Furthermore, 
the  peptide  did  not  give  any  ninhydrin  color,  nor  a  yellow  DNP-derivative, 
although  it  did  give  a  colorless  ether-soluble  derivative  which  showed  the 
typical  spectrum  of  0-DNP  tyrosine  (Fig.  4).  Previous  suggestions  that  the 
missing  A^-terminus  could  be  due  to  a  loop  peptide  had  to  be  abandoned 
in  view  of  the  absence  of  aspartic  or  glutamic  acid.  It  seemed  that  only  the 
presence  of  an  acyl  substituent  on  the  amino  group  could  account  for  the 
observed  properties  of  the  acidic  peptide.  To  search  for  such  a  component, 
the  peptide  was  hydrazinolyzed  and  the  resultant  mixture  of  hydrazides 
(+ tyrosine)  analysed  by  paper  chromatography  in  search  for  any  neutral 
acyl-hydrazides.  As  shown  on  Table  1,  the  main  component,  as  detected 
with  ammoniacal  silver  nitrate,  showed  Rf  values,  in  two  solvents,  which 
were  identical  with  those  of  acetyl-hydrazide. 

Table  1 
Rf   VALUES    OF    SEVERAL    HYDRAZIDES* 


Standard 

Hydiazinolysate 

substance 

of  sample 

0-58 



0-66 

0-66 

0-75 

— 

0-51 

0-51 

0-39 

0-40 

A:  Solvent:  Pyridine -Aniline -Water  (9  : 1  :  4,  by  vol.) 


Formic  hydrazide 
Acetic         „ 
Propionic    „ 
Serine  „ 

Hydrazine 


B:  Solvent:  Collidine -Water  (10  :  2,  by  vol.) 


Formic  hydrazide 
Acetic  „ 

Propionic       „ 
Serine  „ 

Hydrazide 

Decomposition  product  of 
serine  or  its  hydrazide 


Hydrazinolysate 
of  seryl-tyrosine 


*  Detecting  reagent:  Ammoniacal  silver  nitrate  reagent  (0-1  n- 
1  :  1  by  vol.). 


-AgNOg,  0-5  N— NH4OH, 
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Direct  chromatography  of  a  hydrolysate  of  the  acidic  peptide  also  indicated 
the  presence  of  acetic  acid.  It  thus  appears  estabUshed,  that  the  structure 
of  the  acidic  peptide  is  acetyl-ser-tyr.  Thus  acetylation  may  well  be 
responsible  for  the  absence  of  A^-terminal  groups  in  TMV.  The  above  pep- 
tide probably  represents  the  acetyl- A^-terminal  sequence  of  the  virus  protein, 
although  its  occurrence  as  a  branch  chain  is  not  excluded.  This  is  believed 
to  be  the  first  instance*  where  an  acetyl  group  has  been  definitely  shown 
to  be  a  component  of  a  simple  protein,  and  has  been  allocated  to  a  specific 
site  in  that  protein. 

In  conclusion,  then,  TMV  protein  seems  to  be  composed  of  peptide  chains 
of  a  molecular  weight  of  about  18,000  which  instead  of  a  free  amino 
end  terminate  in  acetyl-ser-tyr-  and  which  have  a  thr-gly-ser-pro-ala-thr 
sequence  at  the  carboxyl  end.  About  six  polypeptide  chains  are  united  into 
a  definite  aggregate  (A-protein)  resembling  native  proteins  in  having  masked 
— SH  groups  and  tyrosine-carboxylate  bonds.  The  A-protein  tends  to  aggre- 
gate to  particles  resembhng  in  shape  the  original  virus,  but  lacking  its 
stability. 

It  appears  probable  that  the  — SH  group  (one  per  chain)  is  hydrogen- 
bonded,  playing  the  role  of  an  acceptor,  since  its  H  atom  is  free.  In  the 
degradation  of  the  virus,  about  5000  new  acid  groups  appear  and  become 
titrable  which  may  be  originally  present  as  masked  carboxyl  groups,  hydro- 
gen bonded  in  their  undissociated  form.  To  these  groups  may  be  attributed 
the  remarkable  stability  and  other  surface  properties  which  differentiate  the 
intact  virus  from  the  isolated  protein.  These  same  groups  appear  to  exchange 
their  hydrogen  for  lead  and  with  lesser  affinity  other  divalent  metals. 
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Bei  der  Infektion  einer  Tabakpflanze  mit  der  Protein-freien  Ribonuclein- 
säure  (RNS)  aus  Tabakmosaikvirus  (TMV)  wird  in  der  Zelle  das  spezifische 
Virusprotein  produziert.^  Es  müssen  also  mehr  oder  weniger  direkte  Bezie- 
hungen zwischen  der  Struktur  der  RNS  und  dem  gebildeten  Protein  bestehen. 
Es  ist  anzunehmen,  dass  auch  die  Struktur  anderer  in  Organismen  vorkom- 
mender Proteine  durch  spezifische  Ribonucleinsäuren  bestimmt  wird.  Aber 
nur  im  Falle  des  TMV  gelingt  es,  sowohl  die  RNS  als  auch  das  Protein  leicht 
in  reiner  Form  zu  gewinnen,  so  dass  sich  hier  ein  Beispiel  bietet,  das  von 
allgemeiner  Bedeutung  für  die  Biosynthese  der  Proteine  sein  dürfte.  Leider 
fehlt  es  bisher  noch  an  chemischen  Methoden  zur  Strukturaufklärung  der 
RNS,  so  dass  wir  uns  vorerst  darauf  beschränken  müssen,  den  Proteinanteil 
näher  zu  analysieren. 

Zu  Beginn  der  Strukturuntersuchung  eines  Proteins  erhebt  sich  stets  die 
Frage  nach  der  Einheitlichkeit  des  Untersuchungsmaterials.  Das  TMV  lässt 
sich  verhältnismässig  leicht  von  den  Proteinen  der  Wirtspflanze  abtrennen. 
Wir  prüften  durch  Komplementbindung  die  Reaktion  eines  Antiserums 
gegen  Normalprotein  mit  dem  Virus  und  auch  umgekehrt  die  Reaktion  von 
normalem  Pflanzenprotein  gegen  Antiserum,  das  gegen  unser  Viruspräparat 
hergestellt  worden  war.  In  beiden  Fällen  ergab  sich  übereinstimmend,  dass 
die  auf  dem  üblichen  Wege  (Ammonsulfatfällung,  wiederholte  Ultrazentri- 
fugation)  hergestellten  Viruspräparate  nicht  mehr  als  0,2%  Normalprotein 
enthielten.  2 

Schwieriger  ist  die  Frage  zu  beantworten,  wieweit  die  Virusteilchen  unter 
sich  einheitlich  sind.  Da  beim  TMV  zahlreiche  Mutanten  bekannt  sind,  die 
sich  in  ihren  biologischen  Eigenschaften  unterscheiden,  müssen  wir  annehmen, 
dass  jedes  Viruspräparat  eine  Population  mehr  oder  weniger  nahe  verwandter 
Stämme  darstellt.  Auch  durch  Einzelherd-Isolierung  lassen  sich  nicht  mit 
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Sicherheit  reine  Stämme  gewinnen,  da  bei  der  nachfolgenden  Vermehrung 
des  isolierten  Stammes  die  Möglichkeit  zu  Mutationen  gegeben  ist. 

Gewisse  Versuche  deuten  darauf  hin,  dass  auch  ein  einzelnes  TMV- 
Stäbchen  hinsichtlich  seiner  Struktur  nicht  homogen  ist.  Bei  der  Behandlung 
des  TMV  mit  mildem  Alkali  wird  die  äussere  Proteinhülle  abgestreift  und 
es  entsteht  das  sogennante  A-Protein.  Schramm  und  Mitarbeiter^-^  fanden, 
dass  70%  des  Proteins  verhältnismässig  leicht  von  der  Nucleinsäure  getrennt 
werden  können,  bei  30%  des  Materials  ist  hingegen  die  Bindung  des  Proteins 
an  die  RNS  so  fest,  dass  es  auch  bei  längerer  Behandlung  bei  pH  10,3  nicht 
abgespalten  werden  kann.  Dieser  Anteil  wurde  als  stabile  Fraktion  bezeich- 
net. Die  Ergebnisse  wurden  inzwischen  von  Harrington  und  Schachman^ 
bestätigt.  Die  Autoren  geben  eine  Erklärung,  die  uns  sehr  plausibel  erscheint. 
Danach  wird  von  einem  Ende  her  das  A-Protein  von  dem  Nucleinsäurestrang 
abgestreift,  bis  etwa  2/3  entfernt  sind,  das  Rest-Segment  liefert  die  stabile 
Fraktion.  Die  unterschiedliche  Stabilität  könnte  darauf  beruhen,  dass  die 
Nucleinsäurezusammensetzung  sich  entlang  der  Achse  ändert;  es  können 
aber  auch  Unterschiede  in  den  Peptiduntereinheiten  vorliegen.  Bei  anderen 
Stämmen  des  TMV  findet  sich  ebenfalls  eine  stabile  Fraktion,  der  mengen- 
mässige  Anteil  ist  jedoch  unterschiedlich,  bei  dem  Stamm  Dahlemense  beträgt 
er  nur  12%.  Aus  diesen  Gründen  dürfen  wir  nicht  ohne  weiteres  voraus- 
setzen, dass  wir  vöUig  eindeutige  Resultate  bei  der  Bestimmung  der  Amino- 
säure-Sequenzen erhalten.  Es  erscheint  uns  jedoch  sinnvoll  aufzuklären, 
wieweit  die  Übereinstimmung  geht. 

DIE    GRÖSSE    DER    PEPTID-UNTEREINHEITEN 
BEIM    TMV 

Eine  genaue  Bestimmung  des  Molgewichts  des  TMV  ist  sehr  schwierig.  Für 
die  folgenden  Darlegungen  wird  der  von  Schramm  und  Bergold^  bestimmte 
Wert  von  40.10^  zugrunde  gelegt.  Das  A-Protein,  das  beim  alkalischen 
Abbau  des  TMV  erhalten  wird,  hat  nach  Diffusions-  und  Sedimentations- 
messungen ein  Molgewicht  von  etwa  90  000  ±10%.  Es  handelt  sich  hier 
um  ein  natives  Protein,  das  unter  geeigneten  Bedingungen  wieder  zu  Stäbchen 
von  der  Form  und  Grösse  des  Virus  aggregiert  werden  kann.  Wird  bei  der 
Darstellung  pH  10,6  überschritten,  so  denaturiert  das  A-Protein  und  wird 
unlöslich.  Nach  einer  persönlichen  Mitteilung  von  P.  Newmark^^  lassen  sich 
beim  TMV  auch  native  Untereinheiten  mit  Molgewichten  unter  90  000 
darstellen,  wenn  man  die  Salzkonzentration  sehr  niedrig  hält  und  bis  zur 
äussersten  Grenze  des  erlaubten  pH- Werts  geht.  In  Gegenwart  von  Netz- 
mitteln wurden  von  Schachman  und  Hersh'  Einheiten  erhalten,  die  ein 
Molgewicht  zwischen  10  und  20  000  besassen. 

Eine  genaue  Grösse  der  Peptiduntereinheiten  Hess  sich  durch  Endgruppen- 
Bestimmung  erhalten.  Harris  und  Knight^  fanden,  dass  bei  Einwirkung  von 
Carboxypeptidase  ein  Mol  Threonin  je  17  300  g  Virus  freigesetzt  wird.  Etwa 
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den  gleichen  Wert  fanden  Schramm  und  Braunitzer^  an  dem  von  ihnen 
untersuchten  TMV-Stamm.  Gegen  die  enzymatische  Methode  können  nun 
gewisse  Bedenken  erhoben  werden.  Braunitzer^"'^^  bestimmte  daher  die 
Carboxylendgruppe  auf  chemischem  Wege  mit  der  Hydrazinmethode  von 
Akabori.  Als  einzige  Endgruppe  wurde  ebenfalls  Threonin  und  zwar  in 
einer  Menge  von  1  Mol  je  15  000  g  Virus  festgestellt.  Bei  kurzzeitiger  Ein- 
wirkung von  Hydrazin  (3-4  Stunden  bei  100°C)  gelang  es  auch,  das  Dipeptid 
Ala-Thr  und  das  Tripeptid  Pro-Ala-Thr  in  Form  ihrer  Dinitrophenyl(DNP)- 
Derivate  zu  isolieren.  Nach  Abspaltung  des  Threonins  mit  Carboxypeptidase 
trat  als  einzige  Endgruppe  Alanin  auf  und  zwar  wiederum  in  einer  Menge 
von  ein  Mol  je  16  000  g  Virus.  Zu  etwa  den  gleichen  Ergebnissen  kamen 
unabhängig  hiervon  Niu  und  Fraenkel-Conrat.^^ 

Die  Bestimmung  der  N-terminalen  Konfiguration  in  der  Peptid-Unterein- 
heit  des  TMV  bietet  Schwierigkeiten.  Im  nativen  Virus  sind  keine  end- 
ständigen Aminogruppen  nachweisbar^*-^^.  Nach  Behandlung  des  Virus  mit 
Trichloressigsäure  (pH  1,8,  30  Minuten  bei  85°C)  konnte  jedoch  Prolin  als 
Amino-endständige  Gruppe  nachgewiesen  werden.  Die  Bestimmung  erfolgte 
nach  den  Methoden  von  Sanger^^  und  Edman.^'^  Mit  beiden  Methoden 
wurden  Werte  gefunden,  die  der  Menge  der  Carboxylgruppen  entsprechen. 
Die  Schwierigkeiten  bei  dem  Verfahren  nach  Sanger  liegen  darin,  dass  DNP- 
Prolin  während  der  sauren  Hydrolyse  zu  Dinitrophenol  und  einigen  in 
geringen  Mengen  auftretenden  Nebenprodukten  gespalten  wird.^^  Dieser 
Abbau  kann  teilweise  unterdrückt  werden,  indem  man  die  Hydrolyse  in 
nahezu  wasserfreiem  Medium ^^  durchführt.  Nach  Kontrollversuchen  liefern 
die  anderen  DNP-Aminosäuren  bei  der  sauren  Hydrolyse  kein  Dinitrophenol, 
so  dass  die  Menge  an  endständigem  Prolin  ziemlich  genau  sich  aus  der 
Bestimmung  von  Dinitrophenol +  DNP- Prolin  ergibt. 

UNTERSUCHUNGEN    ÜBER    DEN    EDMANSCHEN 

ABBAU 

Auch  bei  der  Bestimmung  der  Endgruppen  nach  Edman  an  dem  mit  Tri- 
chloressigsäure behandelten  Protein  ergeben  sich  gewisse  Komphkationen. 
Das  Thiohydantoin  des  Prolins  ist  unter  den  Bedingungen  der  CycHsierung 
instabil  und  liefert  in  kleinen  Mengen  ein  Beiprodukt  mit  einem  etwas 
geringeren  Rf-Wert.  Die  quantitative  Auswertung  der  beiden  Flecken  des 
Chromatogramms  ergab  F2400  endständige  Prolin-Moleküle  je  Molekül 
TMV.  Daneben  fanden  wir  in  einer  Menge  von  etwa  20%  noch  andere 
Thiohydantoine,  diese  entsprechen  teilweise  Endgruppen,  die  bei  der  Tri- 
chloressigsäure-Behandlung  im  Innern  der  Peptidketten  freigesetzt  wurden. 
Ausserdem  reagiert  aber  das  Phenylthioisocyanat  (PTI)  mit  gewissen  nicht- 
endständigen  Aminosäuren  unter  Bildung  von  Thiohydantoinen,  wie  wir  an 
reinen  synthetischen  Peptiden  feststellten.  Bei  einstündiger  Reaktionsdauer 
mit  PTI  bei  40°   entstanden   aus    Leu-Gly-Gly   und    Ala-Gly-Gly  3-7% 
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Thiohydantoine  des  nicht-endständigen  Glycins. ^^  Da  eine  Hydrolyse  der 
Peptidkette  unter  diesen  Bedingungen  nicht  erfolgt,  handelt  es  sich  um  einen 
direkten  Angriff  des  PTI  auf  mittelständige  Iminogruppen.  Auch  bei  anderen 
Eivveisstoffen  fanden  wir  stets  neben  den  Banden  der  Endgruppen  in  einer 
Menge  von  2-12%  Nebenbanden,  die  auf  diese  Reaktion  zurück  gehen. 
Auch  Rohrlich  und  Schlüssler^o  fanden,  dass  Gliadin  aus  Weizen  in  kleiner 
Menge  Thiohydantoine  des  Glycins  und  Alanins  liefert,  selbst  wenn  die 
Endgruppen  vorher  mit  dem  DNP-Rest  substituiert  wurden.  In  der  fol- 
genden Tabelle  1  sind  die  Ergebnisse  der  Endgruppenbestimmung  am  TMV 

Tabelle  1 

ENDGRUPPENBESTIMMUNG    NACH    EDMAN  BEI 
VERSCHIEDENEN   PROTEINEN 


Protein 

Endgruppen/Mol 

Nebenbanden  in  %  der 
Gesamtextinktion 

TMV* 

Pro 

2300 

20 

Hämoglobin 
Rind 
Pferd 
Mensch 

Val+Meth 

Val 

Val 

3,9 
3,8 
3,8 

12 
8 
2 

Lysozym 

Lys 

0,9 

10 

und  einigen  anderen  Eiweisstoflfen  zusammengefasst.  Es  ist  darauf  hinzu- 
weisen, dass  die  Ergebnisse  beim  Rinder-Globulin  und  beim  Lysozym  sehr 
gut  mit  den  in  der  Literatur  angegebenen  Werten  übereinstimmen.  Abwei- 
chend von  Porter  und  Sanger^^  fanden  wir  jedoch  beim  Hämoglobin  des 
Pferdes  und  des  Menschen  die  gleiche  Anzahl  von  Endgruppen  wie  beim 
Rind,  was  inzwischen  von  anderer  Seite^^  bestätigt  wurde. 

Durch  Verwendung  substituierter  Phenylthioisocyanate  konnte  die  Neben- 
reaktion teilweise  unterdrückt  werden.  Beim  Abbau  von  Ala-Gly-Gly  und 
Leu-Gly-Gly  mit  ortho-Nitrophenylthioisocyanat  entsteht  in  der  ersten  Stufe 
kein  Glycin-Thiohydantoin,  während  die  Reaktion  der  terminalen  Gruppe 
schneller  und  vollständiger  verläuft  als  mit  Phenylthioisocyanat.  Jedoch  ist 
auch  die  Reaktionsgeschwindigkeit  mit  Wasser  erhöht,  so  dass  sich  grosse 
Mengen  an  Harnstoff-Derivaten  bilden,  die  die  Chromatographie  stören^^ 
(vergleiche  Tabelle  2).  Wir  halten  daher  das  Phenylthioisocyanat  für  das 
geeignetste  Reagenz  für  den  Edmanschen  Abbau.  Im  allgemeinen  ist  die 
Menge  der  Nebenprodukte  so  klein,  dass  die  Identifizierung  der  Endgruppen 
nicht  erschwert  wird  und  die  grosse  Bedeutung  des  Edmanschen  Abbaus, 

*  Mit  Trichloressigsäure  behandelt. 
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den  wir  ständig  für  unsere  Untersuchungen  benützen,  hierdurch  nicht  be- 
einträchtigt wird. 

Tabelle  2 

REAKTIONSFÄHIGKEIT   DER   PHENYLTHIOISOCYANATE 

Die  verschiedenen  substituierten  Thioisocyanate  worden  mit  Leucin  umgesetzt  und  die 
Menge  an  Phenyl thiohy dan toin  des  Leucins  (PTH-Leu)  und  an  Nebenprodukten  chromato- 
graphisch bestimmt. 


Gesamtausbeute  in 

PTH-Leu-Deriv. 

Nebenbanden 

Thioisocyanate 

Mol  %  bez.  auf  Leucin 

in  Mol  % 

in  Mol  %  bez. 
auf  PTH-Leu 

Phenyl- 

90 

82 

8 

o-Nitrophenyl 

100 

80 

20 

m-Nitrophenyl 

120 

80 

40 

p-Nitrophenyl 

120 

70 

50 

o-Äthoxyphenyl 

45 

33 

12 

p-Äthoxyphenyl 

83 

62 

21 

p-Methoxyphenyl 

78 

53 

25 

o-Chlorphenyl 

70 

58 

12 

2,5-Dichlorphenyl 

67 

54 

13 

p-Azobenzolphenyl 

20 

14 

6 

VERSUCHE    ZUR    BESTIMMUNG   DER    N-TERMINALEN 
KONFIGURATION   IM    NATIVEN   TMV-PROTEIN 

Braunitzer^*  versuchte  aufzuklären,  warum  im  nativen  TMV  die  ProHn- 
Endgruppe  nicht  erfassbar  ist.  Es  ist  bekannt,  dass  Amide  des  y-Carboxyls 
der  Glutaminsäure  und  des  jS-Carboxyls  der  Asparaginsäure  durch  Säuren 
sehr  viel  leichter  gespalten  werden  als  a-Peptidbindungen.  Es  war  daher 
wahrscheinlich,  dass  die  Blockierung  auf  einer  derartigen  ß  bzw.  y-Peptid- 
bindung  der  Seitenkette  beruht  und  diese  durch  Trichloressigsäure  gespalten 
werden.  Derartige  Peptidbindungen  werden  auch  durch  Hydroxylamin  leicht 
unter  Bildung  der  entsprechenden  Hydroxamsäuren  gelöst. ^^  In  der  Tat 
wurden  bei  Behandlung  von  TMV  mit  3n  NH2OH — Lösung  bei  pH  7  (24 
Stunden  bei  60°C)  50-80%  der  theoretischen  Menge  an  endständigem  Prolin 
in  Freiheit  gesetzt,  ohne  dass  eine  Spaltung  von  a-Peptidbindungen  in  grös- 
serem Ausmass  bemerkbar  war.  Einige  weitere  Versuche,  die  jedoch  noch 
nicht  abgeschlossen  sind,  sprechen  dafür,  dass  die  Blockierung  durch  die 
/3-Carboxylgruppe  des  Asparaginsäurerestes  zustande  kommt. 

Die  Ergebnisse  der  nach  vier  verschiedenen  Methoden  durchgeführten 
Endgruppenbestimmung  sind  in  Tabelle  3  nochmals  zusammengefasst.  Die 
Existenz  definierter  Peptid-Untereinheiten,  die  anfangs  von  der  Arbeits- 
gruppe in  Berkeley  stark  angezweifelt  wurde,  dürfte  damit  gesichert  sein. 
Als  wahrscheinlichster  Wert  für  das  Molgewicht  dieser  Untereinheiten  ist 
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Tabelle  3 
ZAHL    DER    ENDGRUPPEN    BEIM    TABAKMOSAIKVIRUS 
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Material 

Methode 

Endgruppen 
Mol/40- 106g  TMV 
C-terminal     N-terminal 

Autor 
(Ref.) 

natives  TMV 

Carboxypeptidase 
Hydrazinolyse 

Thr 
Thr 

2300 
2700 

8,9 
10-12 

TMV  beh.  mit 
Trichloressigsäure 

Sanger 
Edman 

Pro 
Pro 

2300 
2300 

14 
19 

16  500  anzusehen.  Dieser  steht  in  bester  Übereinstimmung  mit  den  von 
Knight^^  durchgeführten  Aminosäure-Analysen.  Wenn  man  annimmt,  dass 
jede  Kette  ein  Molekül  Cystein  enthält,  ergibt  sich  ein  Molgewicht  von 
16  300  und  eine  Zahl  der  Aminosäuren  von  circa  150.  Dieses  Molekularge- 
v^icht  stimmt  auch  gut  mit  der  Grösse  der  Untereinheiten  überein,  die  R. 
Franklin  aus  Röntgenuntersuchungen  erhielt.  ^^  Nach  elektronenmikro- 
skopischen Messungen^^  beträgt  die  Länge  des  TMV-Stäbchens  2980  ±10 
Â.  Auf  eine  Länge  von  69  Â  kommen  nach  Franklin  49  Untereinheiten,  auf 
das  ganze  Stäbchen  also  2  100.  Bei  der  Unsicherheit,  mit  der  Längenmessung 
und  Molgewichtsbestimmungen  beim  TMV  behaftet  sind,  ist  diese  Überein- 
stimmung als  befriedigend  anzusehen. 


BESTIMMUNG    DER    AMINOSÄURE-SEQUENZ    IN 
DER    PEPTIDUNTEREINHEIT 

Die  Bestimmung  der  Sequenz  der  Aminosäuren  innerhalb  der  Peptidunterein- 
heit  steht  in  unserem  Laboratorium  erst  am  Anfang,  ich  möchte  mich  daher 
darauf  beschränken,  die  allgemeinen  Methoden  zu  beschreiben,  die  wir 
hierbei  verwenden.  Im  Nucleinsäure-freien  Protein  des  Virus  wurde  zunächst 
durch  Behandlung  mit  Trichloressigsäure  das  N-terminale  Prolin  in  Freiheit 
gesetzt  und  anschliessend  das  Protein  mit  Dinitrofluorbenzol  umgesetzt- 
Hierbei  enstehen  in  der  Peptidkette  drei  chromophore  Gruppen,  eine  des 
endständigen  DNP-Prolins  und  die  beiden  mittelständigen  e-DNP-Lysylreste. 

DNP  DNP 


DNP— Pro- 


-Lys- 


-Lys- 


-Thr 


Durch  anschliessende  enzymatische  Spaltung  des  substituierten  Proteins 
wurde  zunächst  versucht,  die  Reihenfolge  der  Aminosäuren  in  der  Nach- 
barschaft der  drei  chromophoren  Gruppen  festzulegen,  um  hierdurch  3 
Fixpunkte  zu  schaffen,  die  die  gegenseitige  Zuordnung  der  übrigen  Spalt- 
peptide  erleichtern.^^  Die  enzymatische  Spaltung  des  substituierten  Proteins 
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erfolgte  mit  Chymotrypsin,  Subtilisin  und  einem  Proteinasengemisch  aus 
Aspergillus  oryzae.  Das  letztere  bewährte  sich  vor  allem  zur  Herstellung 
kürzerer  Spaltstücke.  Pepsin  eignet  sich  weniger,  da  das  DNP-Protein  im 
sauren  Bereich  zu  wenig  löslich  ist. 

Erwartungsgemäss  erhielten  wir  drei  Gruppen  gelber  Peptide,  die  entweder 
DNP-Prolin  oder  e-DNP-Lysin  enthielten.  Die  Trennung  erwies  sich  zum 
Teil  als  ausserordentlich  schwierig.  Wie  Sanger^^  zeigte,  treten  bei  der  enzy- 
matischen  Spaltung  von  Polypeptiden  häufig  auch  Pyrrolidoncarboxylpeptide 
auf,  die  in  Essigester  leicht  löslich  sind  und  dadurch  die  Trennung  der  DNP- 
Peptide  erschweren.  Die  farblosen  Pyrrolidoncarboxylpeptide  lassen  sich 
jedoch  leicht  durch  die  Fluoreszenz-Auslöschung  im  UV-Licht  in  den 
Chromatogrammen  erkennen.  Als  Trennungsmethoden  benützten  wir:  1.) 
Verteilungschromatographie  an  Kieselgursäulen,  wobei  wir  als  mobile  Phase 
hauptsächlich  Essigester-Butanol-Gemische  und  als  stationäre  Pufferlösungen 
mit  verschiedenem  pH  verwandten.  2.)  Hochspannungselektrophorese.  Wir 
arbeiteten  meist  bei  Spannungen  von  5-  10  000  V  in  der  von  MichP"  ange- 
gebenen Anordnung  in  einem  Toluolbad,  das  durch  Kühlung  auf  etwa  10°C 
gehalten  wurde.  3.)  Papierchromatographie,  wobei  wir  besonders  Pyridin- 
haltige  Lösungen  und  reine  Puffer,  z.B.  2rii  Natriumacetat  benützten. 

Bei  der  Konstitutionsermittlung  der  erhaltenen  DNP-Peptide  bewährte  sich 
neben  den  bekannten  Methoden  auch  das  von  Justisz  und  Ritschard^^ 
beschriebene  Verfahren,  bei  dem  zunächst  der  DNP-Rest  durch  Hydrierung 
zu  einem  Dihydrochinoxalon  abgespalten  wird. 

Das  Restpeptid  kann  entweder  durch  Wiederholung  der  Operation  oder 
nach  Edman  weiter  abgebaut  werden.  Ein  ähnliches  Verfahren  wurde  unab- 
hängig von  Uhlig^^  ausgearbeitet.  Die  Cyclisierung  tritt  bereits  während  der 
Hydrierung  in  Eisessig  oder  in  In  HCl-Methanol  mit  Pt  oder  Pd  ein.  Bei 
der  Umsetzung  des  Reaktionsgemisches  mit  Dinitrofluorbenzol  liefern  die 
Chinoxalone  Bis-DNP-Derivate,  die  näher  charakterisiert  wurden.  Sie  sind 
in  Chloroform  oder  Eisessig  leicht  löslich  und  können  chromatographisch 
identifiziert  werden. 

Zu  Beginn  unserer  Untersuchungen  wurde  bei  der  Spaltung  mit  Chymo- 
trypsin haltigem  Trypsin  ein  kristallisiertes  DNP-Prolylpeptid  isoliert,  das 
bei  der  Analyse  sehr  konstante  Werte  Heferte  und  von  dem  wir  annahmen, 
dass  es  aus  21  Aminosäuren  besteht.  Mit  den  verbesserten  Trennmethoden 
erwies  sich  jedoch  dieses  KristaUisat  als  nicht  einheitlich.  Es  gelang  schliess- 
lich, eine  Reihe  von  DNP-Prolylpeptiden  in  reiner  Form  zu  isoUeren,  von 
denen  sich  die  meisten  der  Sequenz 

DNP-Pro-Ileu-Glu-(Glu,Leu4)-Glu-Leu 
zuordnen  Hessen.  Daneben  fanden  wir  aber  auch  ein  Peptid,  dem  die  Sequenz 

DNP-Pro-Leu-Val 

zukommt  und  das  mit  der  eben  angeführten  Reihenfolge  nicht  zu  verein- 
baren ist.  Beide  Sequenzen  wurden  auch  nebeneinander  bei  der  partiellen 
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Säurehydrolyse  aufgefunden.  Das  Auftreten  dieser  unterschiedlichen  Sequen- 
zen kann  einmal  darauf  beruhen,  dass  das  benützte  TMV-Material  genetisch 
nicht  einheitlich  war.  Es  wäre  aber  auch  möglich,  dass  in  einer  und  derselben 
TMV-Partikel  Polypeptidketten  mit  verschiedenen  Sequenzen  vorkommen. 
Eine  dritte  Möglichkeit  wäre,  dass  eine  Prolyl-Sequenz  aus  der  Mitte  der 
Kette  stammt  und  durch  die  Behandlung  mit  Trichloressigsäure  freigelegt 
wurde.  Da  eine  quantitative  Bestimmung  der  Ausbeuten  an  den  beiden 
Peptidsequenzen  wegen  des  komplizierten  Trennungsganges  nicht  möglich 
war,  kann  die  Entscheidung  zwischen  diesen  Möglichkeiten  zur  Zeit  noch 
nicht  getroffen  werden.  Die  Peptide,  die  einen  DNP-Lysylrest  enthalten, 
lassen  sich  zu  zwei  verschiedenen  Sequenzen  zusammenstellen  : 

Ala-GluNH2-Lys-Pro-Val-(Leu,Phe)  und 
Lys-Pro-Ser-GluNH2-(Gly,Ser-Thr) 

Es  gelten  jedoch  hier  die  gleichen  Einschränkungen  wie  für  das  Prolylpeptid. 
Es  besteht  ebenfalls  die  Möglichkeit,  dass  die  beiden  Lysyl-Sequenzen  nicht 
aus  derselben  Polypeptidkette  stammen.  Zur  Zeit  sind  Untersuchungen  im 
Gang,  durch  Spaltung  am  nicht  substituierten  TMV-Protein  Polypeptide  zu 
isolieren,  um  zu  sehen,  ob  sich  eine  lückenlose  Verbindung  zwischen  den 
beiden  Lysylpeptiden  herstellen  lässt. 

Bei  diesen  Untersuchungen  hoffen  wir  auch  weitere  Aufschlüsse  über  die 
N-terminale  Konfiguration  am  nativen  Virus  zu  erhalten. 
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X-ray  diffraction  studies  of  the  structure 
of  the  protein  of  tobacco  mosaic  virus 

ROSALIND  E.  FRANKLIN 

Crystallographic  Laboratory,  Birkbeck  College, 
University  of  London 


A  systematic  application  of  the  methods  of  X-ray  diffraction  to  the  study 
of  the  structure  of  tobacco  mosaic  virus  (TMV)  has  provided  some  informa- 
tion concerning  the  general  structural  characteristics  of  the  protein  of  the  virus. 

Well -orientated  specimens  of  concentrated  TMV  solutions  can  be  obtained 
in  fine  glass  capillaries,  and  give  excellent  X-ray  fibre-diagrams  (Bernai  and 
Fankuchen,  1941;  Franklin,  1955«).  Fibre-diagrams  which  are  similar  but 
significantly  different  may  be  obtained  from  heavy-atom  derivatives  of  TMV 
(Caspar,  1956;  Franklin  and  Holmes,  1956),  from  the  repolymerized  nucleic 
acid-free  virus  protein  (Franklin,  19556)  and  from  different  strains  of  the 
virus  (Franklin,  1956a).  Structure  analysis  is  based  on  a  detailed  quantita- 
tive comparison  of  the  X-ray  diagrams  obtained  from  all  these  substances. 
The  analysis  so  far  carried  out  is  based  on  only  a  small  part  of  the  X-ray 
diagram,  and  the  work  is  therefore  still  at  a  relatively  early  stage. 

It  has  been  shown  that  the  virus  protein  is  in  the  form  of  structurally 
equivalent  sub-units  set  in  helical  array  about  the  particle  axis  (Watson, 
1954;  Franklin,  1955a)  and  (Franklin,  Klug  and  Holmes,  1956)  that  these 
structural  sub-units  may  be  identified  with  the  chemical  sub-units  established 
by  amino  acid  and  end  group  analysis  (e.g.  Harris  and  Knight,  1952,  1955; 
Schramm  and  Braunitzer,  1953;  Niu  and  Fraenkel-Conrat,  1955). 

The  protein  sub-units  lie  on  a  helix  of  pitch  23  Â,  and  there  are  49  sub- 
units  to  every  3  turns  of  the  helix  (Franklin  and  Holmes,  1956).  A  large 
part  of  the  results  so  far  obtained  is  concerned  with  the  shape  of  these  sub- 
units.  If  the  TMV  particle  were  a  simple  uniform-density  rod,  of  circular 
cross-section,  the  simplest  possible  shape  of  the  protein  sub-units  would  be 
one  resembling  a  slice  of  cake.  We  find,  however,  that  the  density  of  the 
TMV  particle  is  far  from  uniform,  and  that  the  shape  of  the  protein  sub-unit 
is  not  simple.  The  rod-shaped  particle  has  a  hollow  core,  of  radius  about 
20  Ä  (Caspar,  1956;  Franklin,  Klug  and  Holmes,  1956)  and  the  shape  of 
each  sub-unit  is  such  that  it  protrudes  at  both  the  inner  and  outer  surfaces. 
The  helical  array  of  protuberances  on  the  outer  surface  has  already  been 
described  (Frankhn  and  Klug,  1956).  That  on  the  inner  surface  has  been 
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made  apparent  by  recent  work  in  which  a  helical  projection  of  the  virus  struc- 
ture was  calculated  from  X-ray  data.  This  projection  (which  will  be  de- 
scribed in  detail  elsewhere)  shows  that  the  internal  and  external  protuberances 
do  not  occur  opposite  to  one  another,  thus  suggesting  that  the  general  lie 
of  the  protein  sub-units  is  not  strictly  perpendicular  to  the  particle  axis,  but 
somewhat  skew  to  it. 

Independent  evidence  from  three  different  parts  of  the  X-ray  diagram 
indicates  that  the  shape  of  the  protein  sub-units  is  such  that  their  helical 
packing  results  in  a  set  of  holes  in  the  virus  particle  at  a  radial  distance  of 
55-60  Â  from  the  particle  axis.  The  mean  radius  of  the  particle  is  about 
75  Â,  and  the  maximum  radius  about  90  Â.  The  nucleic  acid  lies  at  a  radius 
of  40  Â  (Franklin,  19566).  The  X-ray  results  therefore  suggest  that  the 
nucleic  acid  might  well  be  more  accessible  to  reagents  via  the  external  groove 
and  the  55-60  Â  holes  than  via  the  hollow  central  core  of  the  particle. 

The  X-ray  results,  considered  in  conjunction  with  the  ratio  of  nucleic 
acid  to  protein,  make  it  appear  probable  (though  not  yet  certain)  that  the 
nucleic  acid  is  in  the  form  of  a  single  molecular  chain  following  a  rather  flat 
helical  path,  and  fitting  into  the  protein  structure  in  a  rather  compact  manner. 
The  shape  of  the  protein  sub-units  must  therefore  be  such  as  to  allow  this 
RNA  molecular  chain  to  pass  between  adjacent  sub-unit  layers. 

Thus,  although  it  is  not  yet  possible  to  define  the  shape  of  the  protein 
sub-units,  we  know  that  it  must  be  such  as  to  favour  their  helical  packing 
and  yet  to  leave  holes  between  neighbouring  sub-units  at  a  distance  of 
55-60  Â  from  the  particle  axis,  to  leave  a  hollow  core  of  radius  about  20  Â 
in  the  particle,  to  provide  some  kind  of  gap  through  which  the  RNA  mole- 
cule may  pass  at  a  radius  of  40  Â  and  to  give  a  protuberance  at  both  the 
inner  and  outer  surfaces  of  the  particle. 

Our  X-ray  diffraction  results  also  indicate  that  the  direction  of  the  poly- 
peptide chains  within  the  sub-units  tends  to  be  perpendicular  to  the  particle 
axis,  and  in  a  tangential  rather  than  a  radial  direction. 

It  may  be  of  interest  to  compare  the  information  obtainable  from  X-ray 
fibre-diagrams  of  TMV  and  from  X-ray  diagrams  of  single  crystals  of  globular 
proteins.  Because  of  the  strongly  uniaxial  character  of  the  TMV  particle 
and  also,  in  part,  because  of  its  great  stability,  a  limited  amount  of  informa- 
tion concerning  the  inner  structure  of  the  particle  can  be  obtained  with 
relatively  little  effort.  By  a  much  more  laborious  quantitative  analysis  of 
X-ray  fibre-diagrams  of  TMV  and  its  heavy-atom  derivatives,  analogous  to 
that  being  carried  out  by  Perutz,  Kendrew  and  co-workers  on  globular  pro- 
teins, it  should  be  possible  to  determine  in  some  detail  not  only  the  shape 
of  the  particle  and  of  its  sub-units,  but  also  the  direction  of  the  back-bone 
chains  of  both  the  protein  and  nucleic  acid  components.  Some  progress 
has  already  been  made  in  this  direction.  On  the  other  hand,  the  three- 
dimensional  diffraction  picture  of  TMV,  which  must  be  derived  from  fibre- 
diagrams  only,  will  never  be  as  accurate  as  that  obtained  from  single  crystals 
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of  protein,  and  the  resulting  structural  information  will  therefore  never  be 
as  detailed.  We  cannot  hope,  in  TMV,  to  find  the  positions  of  atoms  by 
X-ray  methods.  Thus,  while  some  general  structural  characteristics  of  the  pro- 
tein of  TMV  can  be  determined  much  more  easily  than  in  the  simpler  struc- 
tures of  crystals  of  globular  proteins,  the  detailed  configuration  of  the 
polypeptide  chains  of  globular  proteins  must  be  studied  in  single  crystals. 
It  is  hoped,  however,  to  find  out  enough  about  the  structure  of  the  protein 
(and  also  of  the  nucleic  acid)  of  TMV  to  give  some  indication  of  the  nature 
of  the  nucleoprotein  complex  in  the  intact  virus. 
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Avant  de  parler  des  résultats  nouvellement  acquis  concernant  la  structure 
chimique  du  lysozyme,  résultats  qui  font  l'objet  essentiel  de  cet  exposé,  il 
paraît  utile  de  rappeler  brièvement  quelques-unes  des  données  précédemment 
établies,  ou  proposées,  se  rapportant  à  la  structure  de  la  protéine  en  ques- 
tion. D'après  toute  une  série  de  déterminations^-^  effectuées  soit  par  voie 
chimique  et  notamment  par  chromatographic,  soit  par  voie  microbiologique, 
le  lysozyme  d'oeuf  de  poule  présente  une  composition  en  acides  aminés 
correspondant  approximativement  à  la  formule  brute  suivante  : 

Glyii  Alaio  Ser9(Cys-)io  Meta  Thr,  Pro2  Valg  Leug  Ileug  Phcg 
Tyra  Trya  Aspao  Glug  Lysg  Argu  HiSi  (NH2)i8  HgO 

le  poids  moléculaire  étant  d'environ  14.800. 

La  recherche  des  acides  aminés  N-terminaux,  recherche  effectuée  soit  par 
la  méthode  de  Sanger,  soit  par  la  méthode  d'Edman,  a  montré  de  façon 
indiscutable  que  le  lysozyme  ne  possédait  qu'un  seul  acide  aminé  N-terminal, 
la  lysine.^-^  D'autre  part,  la  recherche  des  acides  aminés  C-terminaux,  re- 
cherche effectuée  soit  par  l'utilisation  de  la  carboxypeptidase,^'^  soit  par 
la  méthode  de  réduction  par  AlLiH4,'  a  montré  de  façon  non  moins  indis- 
cutable que  le  lysozyme  ne  possédait  qu'un  seul  acide  aminé  C-terminal,  la 
leucine.  Le  lysozyme  d'oeuf  de  poule  est  donc,  selon  toute  vraisemblance, 
constitué  par  une  seule  chaîne  peptidique  non  ramifiée.  En  outre,  il  apparaît 
que  cette  chaîne  ne  contient  aucun  groupe  — SH.^  Les  cinq  résidus  de  cystine 
correspondent  donc  à  cinq  ponts  — S — S —  qui  doivent  former  dans  la 
molécule  autant  de  boucles.  On  sait  également  que  l'ouverture  des  ponts 
• — s — s —  et  leur  remplacement  par  des  groupes  — S.CH2.COOH  ne  pro- 
voque aucune  scission  de  la  molécule,^  ce  qui  confirme  le  caractère  uni- 
caténaire  de  cette  dernière. 

La  formule  brute  qui  vient  d'être  donnée  présente  une  utilité  évidente 
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comme  guide  pour  les  recherches  concernant  la  structure  de  la  molécule; 
mais  elle  ne  peut  être  considérée  comme  rigoureusement  exacte:  la  formule 
définitive  ne  pourra  être  établie,  sans  aucune  ambiguïté,  qu'après  que  la 
structure  chimique  complète,  ou  tout  au  moins  l'ordre  complet  d'enchaîne- 
ment de  tous  les  acides  aminés  constitutifs,  aura  été  déterminé. 

Cette  formule  montre  néanmoins  déjà  quelques  particularités  intéressantes 
ici  :  la  molécule  de  lysozyme  renferme  seulement  un  résidu  d'histidine,  deux 
de  proline,  deux  de  methionine,  trois  de  phenylalanine  et  trois  de  tyrosine. 
Ces  résidus  d'acides  aminés,  assez  caractéristiques,  peuvent  donc  être  uti- 
lisés avec  avantage  comme  marqueurs  des  fragments  de  la  protéine  qui 
les  contiennent;  en  outre,  la  lysine  N-terminale  et  la  leucine  C-terminale 
peuvent  également  être  considérées  comme  marqueurs.  L'isolement  des  pep- 
tides correspondant  à  ces  fragments  et  leur  caractérisation  apparaissent  donc 
comme  un  stade  en  quelque  sorte  préliminaire,  indispensable  à  l'étude  de 
la  structure  chimique  du  lysozyme. 

Cette  formule  montre,  d'autre  part,  que  le  lysozyme  contient  un  nombre 
relativement  élevé  de  résidus  de  tryptophane.  C'est  là  un  caractère  qui,  du  point 
de  vue  qui  nous  occupe,  différencie  cette  protéine  des  autres  protéines  telles 
que  l'insuline,  la  ribonucléase  ou  autres,  dont  la  structure  chimique  a  déjà 
été  établie  ou  est  en  passe  de  l'être  et  qui,  soit  ne  renferment  pas  cet  acide 
aminé,  soit  n'en  contiennent  qu'un  résidu.  La  présence  de  tryptophane  est 
en  effet  une  cause  de  difficultés  supplémentaires,  qui  seront  considérées 
plus  loin. 

Au  cours  d'une  première  série  d'investigations,  faites  à  notre  laboratoire, 
un  certain  nombre  de  peptides  marqués  par  des  résidus  d'acides  aminés  peu 
fréquents  ont  pu  être  isolés.  Ces  peptides  ont  été  obtenus  en  soumettant  le 
lysozyme  à  divers  procédés  d'hydrolyse:  hydrolyse  partielle  acide,  hydro- 
lyse chymotrypsique,  hydrolyse  pepsique  ;  leur  isolement  a  été  fait  essentielle- 
ment à  l'aide  de  la  chromatographic  sur  papier  et  de  l'ionophorèse  sur 
papier.  Ils  ont  néanmoins  permis  d'établir  les  enchaînements  présentés  dans 
le  tableau  1. 

On  voit  que  ces  enchaînements  permettent  déjà  de  caractériser  l'entourage 
de  l'unique  résidu  d'histidine,  et  les  entourages  respectifs  de  chacun  des  trois 
résidus  de  phenylalanine  et  des  trois  résidus  de  tyrosine,  mais  seulement  l'en- 
tourage complet  d'un  seul  résidu  de  proline.  Parmi  les  enchaînements  com- 
portant un  résidu  de  phenylalanine,  l'enchaînement  Lys.Val.Phe.Gly.Arg, 
complétant  l'enchaînement  Lys.Val.Phe.Gly  obtenu  antérieurement  par 
Schroeder,^  est  particulièrement  intéressent,  puisque  c'est  l'enchaînement  de 
l'extrémité  N-terminale  de  la  protéine.  En  outre,  quelques  peptides  contenant 
du  tryptophane  ont  été  caractérisés  (tableau  2). 

D'autre  part,  A.  R.  Thompson^^  a  obtenu  par  hydrolyse  acide  ménagée 
un  assez  grand  nombre  de  peptides;  ceux-ci  ont  été  séparés  par  chromato- 
graphic sur  Dowex  50  x  4,  puis  purifiés  par  chromatographic  sur  papier. 
L'auteur  australien,  par  des  tentatives  de  recombinaison,  en  a  conclu  à 
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Tableau  1 

PEPTIDES    'MARQUÉS'    PAR    HIS,    TYR, 

PHE,    PRO    OU    MET,    ISOLÉS    SUR 

PAPIER    ET    RECONSTITUÉS    APRÈS 

HYDROLYSE    ACIDE,    TRYPSIQUE, 

PEPSIQUE    OU    CHYMOTRYPSIQUE 


279 


1.  Arg.His.Lys* 

2.  Asp.Tyr.Arg.Glyio 

3.  Arg.Gly.Tyr.Ueu.Leuio 

4.  Asp(NH2).Ala.Tyr.Gly.Ser.Leu.Asp(NH2)io 

5.  Lys.Val.Phe.GIy.Arg4 

6.  Glu.Ser.Phe.Aspio 

7.  Ala.Lys.Phe.Gluio 

8.  Leu.Proio 

9.  Thr.Pro.Gly.Ser.Arg" 

10.  AIa.AIa.Metio 


Tableau  2 

PEPTIDES     CONTENANT    DU    TRYPTOPHANE, 
ISOLÉS    PAR    CHROMATOGRAPHIE   SUR   PAPIER 
APRÈS    HYDROLYSE    CHYMOTRYPSIQUE12 


1.  Asp.Ala.Try 

2.  Val.AIa.Try 

3.  (Gly,Arg).Try 

4.  (Gly,Ala,Try,Arg).Leu 

Tableau  3 
ENCHAÎNEMENTS    PROPOSÉS   PAR   A.   R.   THOMPSON" 


1.  Thr.Asp.Val.Glu.Ala 

14.  Thr. Asp. Arg. Arg 

2.  Ileu.Glu.Leu.Ala.Leu 

15.  Thr.Gly.Asp.Val 

3.  Asp.Glu.Ala 

16.  Ser.Val.Cys.Ala.Lys.GIy 

4.  Leu.Thr.Ala. 

17.  Gly.Cys.Asp 

5.  Glu.Asp.Ileu 

18.  Leu.Gly.Ala.Val 

6.  Thr.Glu.AIa.Gly 

19.  Asp.Ileu.Pro.Cys 

7.  Ser.Asp.Gly.Met.Asp 

20.  Arg.Cys.Lys.Gly 

8.  Asp.Ala.Met.Lys.Cys.Arg 

21.  Ser.Val.Asp.Cys.Ala 

9.  Val.Thr.Pro.Gly.Ala 

22.  Asp.Leu.Cys.Asp 

10.  Ser.Asp.Arg 

23.  Arg.Asp.Cys.Ileu 

11.  Lys.Phe.Glu.Gly 

24.  Ser.Arg.Leu 

12.  Arg.Cys.GIu.Ala 

25.  Ser.Asp.Cys.Arg.Leu 

13.  Ser.Phe.Asp.GIu 
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l'existence,  dans  le  lysozyme,  des  enchaînements  suivants  (tableau  3),  Mal- 
heureusement, certains  de  ces  enchaînements  présentent  encore  un  caractère 
quelque  peu  hypothétique. 

D'autre  part,  enfin,  Ohno,^*  après  avoir  soumis  le  lysozyme  à  une  hydra- 
zinolyse  ménagée,  a  été  conduit  à  proposer  comme  enchaînement  C-terminal 
de  la  protéine:  Asp.Gly.Ala.Asp,(NH2).Leu.  On  verra  cependant  plus  loin 
que  cet  enchaînement  ne  paraît  pas  correspondre  à  la  réalité. 

Les  diverses  techniques  d'obtention  ou  de  séparation  des  peptides  à  partir 
du  lysozyme,  telles  qu'elles  ont  été  utilisées  jusqu'ici,  ne  permettaient  pas, 
à  elles  seules,  d'aller  beaucoup  plus  loin  dans  ce  genre  de  travail.  D'autre 
part,  la  difficulté  d'établir  des  enchaînements  avec  certitude  nécessitait  l'utilisa- 
tion de  nouvelles  investigations  dont  les  résultats  devaient  permettre  de  con- 
firmer, ou  éventuellement  d'infirmer  les  conclusions  tirées  jusqu'alors.  Aussi, 
avons-nous  poursuivi  nos  investigations  d'une  part  en  utilisant  des  procédés  de 
scission  de  la  molécule  protéique  aussi  spécifique  que  possible,  et  d'autre  part, 
en  mettant  en  oeuvre,  pour  la  séparation  et  l'analyse  des  peptides  résultant  de 
cette  scission,  l'élégante  méthode  de  Hirs,  Moore  et  Stein.  ^^  Cette  méthode 
permet,  en  principe,  de  recueiller  et  de  séparer  la  totalité  des  peptides  pro- 
venant de  la  scission  d'une  protéine. 

HYDROLYSE    DU    LYSOZYME    PAR    LA    TRYPSINE 

Pour  avoir  un  nombre  de  peptides  raisonnable,  nous  nous  sommes  adressés 
tout  d'abord  à  l'hydrolyse  enzymatique  par  la  trypsine.  La  spécificité  de 
cet  enzyme  est  maintenant  bien  étabhe;  le  fait  qu'il  ne  coupe  que  les  liaisons 
peptidiques  dans  lesquelles  sont  impliquées,  par  leur  groupe  carboxylique, 
l'arginine  et  la  lysine,  faisait  prévoir  que  le  nombre  maximum  de  peptides 
différents  que  l'on  pouvait  obtenir  à  partir  du  lysozyme  était  de  17  ou  18. 
Mais  la  présence,  dans  la  protéine,  d'une  série  de  boucles  dues  à  l'existence 
de  ponts  — S — S —  fait  que  certains  des  peptides  détachés  par  l'action  de  la 
trypsine  doivent  avoir  une  structure  compliquée  et  risquent  de  ne  pas  pouvoir 
être  séparés  convenablement  par  chromatographic.  Aussi  l'action  de  la 
trypsine  a-t-elle  été  appliquée  ici,  non  seulement  sur  le  lysozyme  n'ayant  pas 
subi  de  traitement  autre  qu'une  dénaturation  par  la  chaleur,  mais  aussi  sur  du 
lysozyme  dont  les  ponts  — S — S —  ont  été  rompus  par  oxydation  et  trans- 
formés ainsi  en  un  nombre  double  de  groupes  sulfoniques. 

Action  de  la  trypsine  sur  le  lysozyme  non  préalablement  oxydé.  Plusieurs 
observations^^-^'  ont  montré  que,  comme  d'autres  protéines,  le  lysozyme 
'natif,  c'est-à-dire  obtenu  dans  des  conditions  telles  qu'il  ait  conservé  toute 
son  activité  enzymatique,  est  à  peu  près  complètement  résistant  à  l'action 
de  la  trypsine,  tout  au  moins  dans  les  conditions  habituelles.  Pour  que  la 
protéine  puisse  être  attaquée  par  la  trypsine,  il  convient  donc  de  la  dénaturer, 
ce  que  l'on  obtient  facilement  ici  par  chauffage  d'une  heure  à  100°  en  solu- 
tion aqueuse.  Cette  manière  de  faire  a  l'avantage  de  conserver  intacts  les 
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résidus  de  tryptophane.  On  fait  agir  la  trypsine  sur  le  lysozyme  ainsi  dénaturé, 
pendant  6  heures,  à  37°,  à  pH  7,8,  le  rapport  enzyme/substrat  étant  de 
1/100.  Une  première  séparation  des  peptides  produits  par  la  trypsine  est 
effectuée  sur  colonne  de  Dowex  50  x  2.  Les  détails  de  cette  séparation  et 
les  résultats  qu'elle  a  permis  d'obtenir  sont  indiqués  dans  la  fig.  1.^^ 


1,0 


0,5 


3S'C 


-50°C- 


Vol.  chambre  de  mélange  :     2   t. 


,1  f..,.o.s  e. 
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Fig.  1 .  Séparation  des  produits  obtenus  par  action  de  la  trypsine  sur  25  nM  de  lysozyme 
dénaturé  non  oxydé. 

coloration  de  la  réaction  à  la  ninhydrine  après  hydrolyse  alcaline. 

coloration  de  la  réaction  à  la  ninhydrine  effectuée  directement. 


On  constate  la  présence  de  9  seulement  des  17  ou  18  pics  que  l'on  pouvait 
espérer  obtenir.  La  différence  correspond,  comme  on  le  verra  plus  loin,  aux 
peptides  qui  contiennent  de  la  cystine  et  qui  sont  énergiquement  retenus 
sur  la  colonne.  Après  cette  première  séparation,  les  substances  ont  été 
soumises  à  une  série  de  contrôles  concernant  leur  pureté  et  ont  été,  lorsque 
cela  s'est  révélé  nécessaire,  purifiés  par  diverses  méthodes:  nouvelles  chroma- 
tographies sur  colonne,  et,  après  dessalification,  chromatographies  sur  papier 
et  electrophoreses  sur  papier.  ^^  Il  a  été  ainsi  possible  d'étudier  de  façon 
précise  chacune  des  substances,  ammoniac,  acides  aminés  et  peptides  con- 
stituant les  pics  représentés  par  la  fig.  1.  La  nature  des  acides  aminés  et  la 
composition  des  peptides  est  donnée  dans  le  tableau  4. 

En  plus  des  acides  aminés  libres  (leucine,  lysine,  arginine)  et  de  l'ammoniac, 
quatre  peptides  seulement,  T.6  à  T.9,  ont  été  obtenus  avec  un  rendement 
supérieur  à  10  %.  Leur  structure  a  été  établie^"  en  mettant  en  oeuvre  les 
techniques  habituelles:*  détermination  des  acides  aminés  N-terminaux  par 
la  méthode  de  Sanger  et  la  méthode  d'Edman,  cette  dernière  étant  éventuelle- 
ment utilisée  également  pour  l'établissement  des  enchaînements  N-terminaux, 
hydrolyses  partielles  acides  ou  hydrolyses  chymotrypsiques  fournissant  des 
peptides  plus  courts  étudiés  à  leur  tour  d'une  part  à  l'aide  des  méthodes 

*  Le  détail  des  opérations  ayant  permis  d'établir  les  structures  indiquées  ci-dessous  est 
donné  dans  19. 
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Tableau  4 

COMPOSITION   DES  PICS    SÉPARÉS   APRÈS   HYDROLYSE 

TRYPSIQUE    DU    LYSOZYME    NON   PRÉALABLEMENT 

OXYDÉ  18 

Les  rendements  de  chaque  peptide  sont  indiqués  entre  parenthèses 


T.l  Leucine  Hbre  (50  %)  (correspond  à  leucine  terminale). 

T.2  Peptide  à  très  faible  rendement,  contenant  Try,  etc. 

T.3  NH3 

T.4  Lysine  hbre  (92  %)  (correspond  à  lysine  initiale). 

T.5  Arginine  libre  (40  %). 

T.6  (Asp(NH2),  Arg)  (45  %). 

T.7  (Gly,  Ser,  Pro,  Thr,  Arg)  (36  %). 

T.8  (Gly,  Ala,  Ser,  Leu,  Tyro,  Aspa,  Glu,  Lys,  Arg,  His)  (17  %). 

T.9  (Ala,  Ser,  Thr,  Phcg,  Aspa,  Glug,  Arg)  (25  %). 


précédentes,  et  d'autre  part  au  moyen  de  la  carboxypeptidase,  etc.  Les 
résultats  trouvés,  combinés  à  ceux  qui  avaient  été  précédemment  obtenus 
dans  ce  laboratoire  (tableau  1)  ont  permis  d'attribuer  à  ces  peptides  les 
structures  indiquées  dans  la  tableau  5. 

Tableau  5 

STRUCTURE    DES    PEPTIDES    ISOLÉS    APRÈS    HYDROLYSE 
TRYPSIQUE    DU    LYSOZYME    NON    OXYDÉ20 


T.6  Asp(NH2).Arg 

T.7  Thr.Pro.Gly.Ser.Arg 

T.8  His.Lys.Glu.*  Asp  *.  Ala.Tyr.Gly.Ser.Leu.  Asp.  Asp(NH2).Tyr.  Arg 

T.9  Phe.Glu.*Ser.Phe.Asp.Glu.*Ala.Thr.Asp.*Arg 


Les  données  des  tableaux  4  et  5  montrent  que  les  peptides,  isolés  après 
action  de  la  trypsine  sur  le  lysozyme  simplement  dénaturé  par  la  chaleur, 
présentent  tous  les  quatre  l'arginine  comme  acide  aminé  C-terminal.  Aucun 
ne  contient  de  cystine;  un  seul  renferme  de  la  lysine.  On  peut  donc  s'attendre 
à  ce  que  ce  soit  les  peptides,  qui,  renfermant  de  la  cystine,  sont  retenus  dans 
les  conditions  actuelles  sur  la  colonne,  et  qui  possèdent  la  lysine  comme 
acide  aminé  C-terminal.  Cette  observation  montre  l'intérêt  d'oxyder  le  lyso- 
zyme avant  de  le  soumettre  à  l'action  de  la  trypsine  pour  pouvir  obtenir  la 
totalité  des  peptides  libérés  par  hydrolyse  trypsique. 

Action  de  la  trypsine  sur  le  lysozyme  oxydé.  La  transformation  des  ponts 
— S — S —  du  lysozyme  en  groupes  sulfoniques  a  été  faite  ici  au  moyen  de 
l'acide  performique,  dans  les  conditions  particulièrement  douces  décrites  par 
*  Il  n'a  pas  été  déterminé  si  ce  résidu  est  amidé  ou  non. 
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du  Vigneaud  et  col.'^^  On  doit  toutefois  remarquer  qu'il  est  pratiquement 
impossible  d'éviter  lors  de  l'oxydation  en  question  la  destruction  de  la 
majeure  partie  du  tryptophane,  qui  se  trouve  essentiellement  transformé  en 
kynurénine^^;  il  semble  toutefois  que,  au  moins  dans  la  majorité  des  cas,  la 
transformation  subie  par  les  résidus  de  tryptophane  n'entraîne  pas  la  rupture 
des  liaisons  peptidiques  auxquelles  ils  prennent  part.  D'autre  part,  il  semble 
également  que,  dans  les  conditions  actuelles,  la  destruction  des  résidus  de 
tyrosine  soit  à  peu  près  complètement  évitée.  Néanmoins,  il  convient  d'inter- 
préter avec  prudence  les  données  obtenues  à  partir  du  lysozyme  ainsi  oxydé. 
Le  lysozyme  oxydé  a  été  soumis  à  l'action  de  la  trypsine  dans  les  mêmes 
conditions  que  le  lysozyme  non  oxydé.  Les  produits  résultant  de  cette  action 
ont  été  séparés  par  chromatographic  sur  colonne  de  Dowex  50  X  2.  Les 
conditions  de  cette  séparation  et  les  résultats  qu'elle  a  fournis  sont  indiqués 
dans  la  fig.  2.^^ 
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Fig.  2.  Séparation  des  produits  obtenus  par  action  de  la  trypsine  sur  25  /xM  de  lysozyme 
oxydé. 

coloration  de  la  réaction  à  la  ninhydrine  après  hydrolyse  alcaline. 

coloration  de  la  réaction  à  la  ninhydrine  effectuée  directement. 


On  constate  ici  la  présence  de  17  pics;  après  cette  première  séparation, 
les  substances  correspondant  à  chacun  des  pics  ont  été,  comme  dans  le  cas 
du  lysozyme  non  oxydé,  soumises  à  divers  contrôles  concernant  leur  homo- 
généité et  ont  été  purifiées  par  les  méthodes  habituelles.  Il  a  été  ainsi  possible 
de  caractériser  de  façon  précise  chacune  de  ces  substances:  la  nature  des 
acides  aminés  et  la  composition  des  peptides  sont  données  dans  le  tableau  6. 

En  plus  des  acides  aminés  libres  (leucine,  lysine,  arginine)  et  de  l'ammoniac, 
12  peptides  ont  été  obtenus  avec  un  rendement  suffisant  pour  permettre 
l'étude  de  leurs  structures.  Ces  structures  ont  été  établies  pour  9  d'entre 
eux.  Les  résultats  trouvés,  combinés  à  ceux  précédemment  obtenus  (tableau  1) 
ont  permis  d'attribuer  à  ces  peptides  les  constitutions  indiquées  dans  le 
tableau  7. 
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Tableau  6 

COMPOSITION    DES    PICS    SÉPARÉS    APRÈS   HYDROLYSE 
TRYPSIQUE   DU   LYSOZYME  PRÉALABLEMENT   OXYDÉi» 

Les  rendements  pour  chaque  peptide  sont  indiqués  entre  parenthèses 


TO.l     (Alao,  Sero,  CySOgH,  Thri.g,  Pro,  Val,  Leu(?),  lieu,  Aspa-s,  Lys). 

T0.2    même  composition  que  TO.l  (plus  amidé)  (6  %). 

„„  -  r  Leucine  libre  (15  %)  (correspond  à  leucine  terminale)  (Cf  T.l), 

^"•"^{(Alaa,  CySOgH,  (MetOo),  Leu,  Glu,  Lys)  (35  %). 

T0.4    non  étudié  (5  %). 

T0.5    (Glya,  Serj,  Thri_2,  Leu,  Ueuo,  Tyr,  Aspa-a,  Glu). 

T0.6    (CySOgH,  Lys)  (35  %). 

T0.7    NHg  (Cf.  T.3). 

T0.8    Lysine  libre  (78  %)  (correspond  à  lysine  initiale)  (Cf.  T.4). 

T0.9    (Gly,  CySOgH,  Leu,  Arg)  (58  %)  (peptide  C-terminal). 

TO.IO    (Alag,  CySOgH,  (MetOg),  Leu,  Glu,  Lys,  Arg)  (17  %). 

TO.  11     (Glys,  Sero,  Thri-2,  Leu,  Ileug,  Tyr,  Asp4,  Glu,  Arg)  (7  %). 

TO.  12    (Gly,  Val,  Phe,  Arg)  (16  %). 

TO.  13    non  étudié. 

xn  14/  Arginine  libre  (Cf.  T.5), 

^^•'^\(Asp(NH2),Arg)(79  %). 

TO.  15    (Gly,  Ser,  Thr,  Pro,  Arg)  (60  %). 

TO.  16    (Gly,  Ala,  Ser,  Leu,  Tyro,  Aspg,  Glu,  Lys,  Arg,  His)  (19  %). 

TO.  17    (Ala,  Ser,  Thr,  Phco,  Aspa,  Glua,  Arg)  (25  %). 


Tableau  7 

STRUCTURE    DES    PEPTIDES    ISOLÉS    APRÈS    HYDROLYSE 
TRYPSIQUE   DU    LYSOZYME    OXYDÉ^» 


T0.3  CySOgH.Glu.+Ala.Leu.Ala.Ala.MetOg.Lys 

T0.6  CySOgH.Lys 

T0.9  Gly.CySO3H.Arg.Leu 

TO.IO  CyS0gH.Glu*.Ala.Leu.Ala.Ala.Met02.Lys.Arg 

TO.  12  Val.Phe.Gly.Arg 

T0.14  Asp(NH2).Arg  (Cf.  T.6). 

TO.  15  Thr.Pro.Gly.Ser.Arg  (Cf.  T.7). 

T0.16  His.Lys.Glu*.Asp*.Ala.Tyr.Gly.Ser.Leu.Asp.Asp  (NH2). 

Tyr.Arg.  (Cf.  T.8). 

T0.17  Phe.Glu*.Ser.Phe.Asp.Glu*.Ala.Thr.Asp*.Arg  (Cf.  T.9). 


Les  données  du  tableau  7  montrent  tout  d'abord  que  les  peptides  TO.  14, 
TO.  15,  TO.  16  et  TO.  17  isolés  après  action  de  la  trypsine  sur  le  lysozyme 
oxydé  sont  respectivement  les  mêmes  que  les  peptides  T.6,  T.7,  T.8  et  T.9 
résultant  de  l'hydrolyse  trypsique  du  lysozyme  non  oxydé.  On  voit  également 
que,  parmi  les  peptides  nouveaux  obtenus,  quatre  correspondent  à  des 
enchaînements  du  lysozyme  contenant  de  la  cystine  et  deux  possèdent  la 
*  Il  n'a  pas  été  déterminé  si  ce  résidu  est  amidé  ou  non. 
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lysine  comme  acide  aminé  C-terminal;  on  constate  d'autre  part  que  le  peptide 
TO.  10  correspond  au  même  enchaînement  que  le  peptide  TO. 3  avec  seule- 
ment un  résidu  d'arginine  supplémentaire,  en  position  C-terminale.  Enfin, 
le  peptide  T0.9  est  particulièrement  intéressant:  il  se  différencie  de  tous  les 
autres  peptides  obtenus  par  hydrolyse  trypsique  et  dont  la  structure  a  été 
établie,  du  fait  que  l'acide  aminé  qu'il  possède  en  position  C-terminale  n'est 
ni  la  lysine,  ni  l'arginine,  mais  la  leucine.  On  verra  plus  loin  que  le  peptide 
T0.9  correspond  à  l'enchaînement  C-terminal  du  lysozyme. 

REMARQUES 

1.  Le  peptide  T0.5  mérite  une  mention  particuhère;  il  ne  contient  en  effet 
aucun  résidu  d'acide  aminé  basique;  sa  production  correspond  donc  à  une 
coupure  aberrante,  peut-être  due  à  la  présence  d'un  enzyme  protéolytique 
étranger  dans  l'échantillon  de  trypsine  utilisé.  De  toute  façon,  le  rendement 
de  ce  peptide  est  très  faible. 

2.  La  liste  des  peptides  fournis  par  les  tableaux  6  et  7  ne  rend  pas  compte 
de  l'ensemble  des  résidus  d'acides  aminés  du  lysozyme;  il  manque  notam- 
ment les  fragments  de  la  protéine  qui  contiennent  les  résidus  de  tryptophane. 
Il  est  intéressent  à  ce  sujet  de  noter  qu'il  a  été  possible  d'isoler  sur  papier, 
après  action  de  la  trypsine  sur  le  lysozyme  non  oxydé,  un  peptide  dont  la 
structure  est  Val.Ala.Try.Arg.  Ce  peptide  ne  se  retrouve  pas  après  chromato- 
graphic de  l'hydrolysat  trypsique  sur  colonne  de  Dowex  50  X  2;  comme 
probablement  les  autres  peptides  renfermant  du  tryptophane,  il  reste  forte- 
ment adsorbé  sur  la  résine  et  ne  pourrait  être  élue  qu'en  milieu  très  alcalin. 

HYDROLYSE    DU    LYSOZYME    PAR    LA    PEPSINE 

Il  paraît  utile  de  donner  ici  les  résultats  d'une  première  étude  des  produits 
résultant  de  l'action  de  la  pepsine  sur  le  lysozyme.  ^^  Comme  dans  le  cas  de 
l'hydrolyse  trypsique,  l'hydrolyse  par  la  pepsine  a  porté  d'une  part  sur  le 
lysozyme  non  oxydé,  ici  le  lysozyme  natif,  et  d'autre  part  sur  le  lysozyme 
préalablement  oxydé  par  l'acide  performique.  Dans  les  deux  cas,  la  pepsine 
exerce  son  action  pendant  24  heures,  à  37°,  à  pH  2,2,  le  rapport  enzyme/sub- 
strat étant  de  1/100.  Les  premières  séparations  des  produits  formés  sont 
effectuées  par  chromatographic  sur  colonne  de  Dowex  50  x  2,  comme 
précédemment.  Les  détails  de  ces  séparations  et  leur  résultat  sont  indiqués 
dans  la  fig.  3  pour  le  lysozyme  natif  et  dans  la  fig.  4  pour  le  lysozyme  oxydé. 
Les  fractions  correspondant  à  un  pic  donné  sont  réunies,  dessahfiées  puis 
soumises  aux  opérations  habituelles  de  purification.  On  constate  ainsi  que 
la  plupart  des  pics  renferment  deux  et  quelques-uns  même  trois  peptides 
différents.  La  structure  de  quelques  uns  seulement  des  peptides  ainsi  obtenus  a 
été  jusqu'ici  établie  : 

PA.  6  ou  PO.  6  Val.Glu(NH2).Ala       PA.  16  b  Gly.Tyr. lieu. Leu 

PA.  7  ou  PO.  7       Ileu.Thr.Ala  PA.  17  ou  PO.  17     Ala.Lys.Phe 

PA.  12  ou  PO.  12  Gly.Ileu.Leu  PA.  18c  ou  PO.  18c  Tyr.Arg.Gly 
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Fig.  3.  Séparation  des  produits  obtenus  par  action  de  la  pepsine  sur  20  /xM  de  lysozyme 
non  oxydé. 

coloration  de  la  réaction  à  la  ninhydrine  après  hydrolyse  alcaline. 

coloration  de  la  réaction  à  la  ninhydrine  effectuée  directement. 
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Fig.  4.  Séparation  des  produits  obtenus  par  action  de  la  pepsine  sur  20  ^M  de  lysozyme 
oxydé. 

coloration  de  la  réaction  à  la  ninhydrine  après  hydrolyse  alcaline. 

coloration  de  la  réaction  à  la  ninhydrine  effectuée  directement. 


ETAT   DES    CONNAISSANCES    ACQUISES    DANS    LE 

PRÉSENT    TRAVAIL    SUR    LA    STRUCTURE    DU 

LYSOZYME 

Enchaînement  N-terminal  et  enchaînement  C-termînal.  L'enchaînement  N- 
terminal  a  été,  il  y  a  déjà  quelques  années^,  démontré  être:  H.Lys.Val.Phe. 
Gly.Arg.  Il  est  donc  inutile  d'y  revenir  ici.  L'enchaînement  C-terminal  a  fait 
récemment  l'objet  d'une  étude  détaillée. ^^  De  nombreux  arguments  mon- 
trent qu'il  est  constitué  par  Gly.Cys.Arg.Leu.OH.  Les  principaux  de  ces 
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arguments  sont  les  suivants:  1)  la  leucine  C-terminale  dans  le  lysozyme  est 
immédiatement  précédée  d'un  résidu  d'acide  aminé  basique,  lysine  ou  argi- 
nine,  étant  donné  la  spécificité  de  la  trypsine:  2)  le  peptide  T0.9,  obtenu 
avec  un  rendement  de  60  %,  est  le  seul  peptide  qui  possède  la  leucine  comme 
acide  aminé  C-terminal,  cette  leucine  étant  immédiatement  précédée  par 
l'arginine:  3)  l'action  de  la  carboxypeptidase  sur  le  DNP-lysozyme  conduit, 
après  un  temps  suffisamment  long,  à  la  libération  d'une  quantité  faible  mais 
nette  d'arginine,  alors  que  la  leucine  terminale  a  déjà  été  détachée. 

Enchaînements  intérieurs.  Tenant  compte  de  ce  que  la  molécule  de  lysozyme 
renferme  seulement  trois  résidus  de  tyrosine,  trois  résidus  de  phenylalanine, 
un  résidu  d'histidine,  et  utilisant  les  peptides  caractérisés  antérieurement 
dans  ce  laboratoire  (Tableaux  1  et  2),  il  est  possible  de  représenter  l'état 
actuel  de  nos  connaissances  sur  la  structure  du  lysozyme  comme  il  est 
indiqué  dans  le  tableau  8. 

Tableau  8 

ENSEMBLE   DES    RÉSULTATS    OBTENUS    DANS 

LE    PRÉSENT    TRAVAIL    SUR    LA    STRUCTURE    DU 

LYSOZYME 


H.Lys.Val.Phe.Gly.Arg. 

[.Arg.His.Lys.Glu*.Asp(NH2).Ala.Tyr.Gly.Ser.Leu.Asp.Asp(NH2). 
Tyr.Arg.Gly.],[.Ala.Lys.Phe.Glu*.Ser.Phe.Asp.GIu*.Ala.Thr.Asp*. 
Arg.],[.Cys.Glu*.Ala.Leu.Ala.Ala.Met.Lys.Arg.],[Thr.Pro.Gly. 
Ser.Arg.],[.Val.Ala.Try.Arg.],[.Asp.Ala.Try],[Ileu.Thr.Ala.],[.Gly.Ileu.Leu.], 
[.Val.Glu(NH2).AIa.],[.Asp(NH2).Arg.],[.Leu.Pro.],[.Cys.Lys.], 
[.Arg.GIy.Tyr.Ileu.Leu.(Gly,Ser2,Thr,Ileu,Asp4*,Glu*).Arg.], 

Gly.Cys.Arg.Leu.OH. 


Les  données  du  tableau  8  appellent  les  commentaires  suivants:  1)  L'exis- 
tence du  peptide  PO.  17  (ou  FA,  17)  Ala.Lys.Phe  prouve  que  l'enchaînement 
Ala. Lys  doit  précéder  immédiatement  le  peptide  TO.  17  (ou  T.9);  en  effet 
les  places  des  trois  résidus  de  phenylalanine  sont  connues  et  le  peptide 
TO.  17  (ou  T.9)  est  le  seul  présentant  la  phenylalanine  comme  résidu  N- 
terminal.  2)  L'existence  du  peptide  PO.  18  Tyr.Arg.Gly  permet  d'allonger 
d'un  résidu  d'acide  aminé  le  peptide  TO.  16  (ou  T.8)  du  côté  C-terminal; 
le  peptide  TO.  16  (ou  T.8)  est  en  effet  le  seul  qui,  obtenu  par  hydrolyse  tryp- 
sique,  possède  l'enchaînement  Tyr.Arg  comme  enchaînement  C-terminal,  et 
les  places  des  deux  autres  résidus  de  tyrosine  sont  connues.  3)  La  formation  par 
hydrolyse  trypsique  du  peptide  TO.  1 6  (ou  T.8),  présentant  un  résidu  d'histidine 
comme  résidu  N-terminal,  implique  que  dans  le  lysozyme  ce  résidu  soit  lié 
à  un  résidu  de  lysine  ou  d'arginine.  L'isolement  du  peptide  Arg.His.Lys 
*  Il  n'a  pas  été  déterminé  si  ces  résidus  sont  amides  ou  non. 
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(1.  tableau  1)  montre  que  ce  résidu  est  l'arginine.  4)  Les  enchaînements 
Ala.Lys.Phe.Glu  (7.  tableau  1)  et  Glu.Ser.Phe.Asp  (6.  tableau  1)  font  partie 
d'un  enchaînement  plus  important  auquel  participe  également  le  peptide 
TO.  17  (ou  T.9).  5)  Les  enchaînements  2  et  4  du  tableau  1,  tous  deux  marqués 
par  la  tyrosine,  se  rencontrent  l'un  à  la  suite  de  l'autre  dans  le  peptide 
TO.  16  (ou  T.8).  Il  convient  toutefois  de  remarquer  qu'une  correction  est 
ici  nécessaire:  le  résidu  asparagine  C-terminal  dans  l'enchaînement  4  du 
tableau  1  n'est  pas  précédé  immédiatement  d'un  résidu  de  leucine,  mais  en 
est  séparé  par  un  résidu  d'acide  aspartique.  D'autre  part,  deux  résidus  de 
tyrosine  se  trouvant  ainsi  localisés  dans  le  peptide  TO.  16  (ou  T.8),  le  seul 
résidu  de  tyrosine  restant  disponible  est  celui  qui  se  trouve  dans  le  peptide 
TO.ll.  Il  doit  donc  faire  partie  de  l'enchaînement  3  du  tableau  1.  On  est 
donc  conduit  à  penser  que  le  peptide  TO.ll  fait  partie  d'un  ensemble  dont 
la  structure  partiellement  connue  est  celle  indiquée  dans  le  tableau  8.  6)  Le 
peptide  TO.  10  correspondant  à  l'enchaînement  Cys.Glu.Ala.Leu.Ala.Ala. 
Met.Lys.Arg  est  en  bon  accord  avec  le  tripeptide  Ala.Ala.Met  (10.  tableau  1), 
mais  il  est  en  contradiction  avec  l'enchaînement  Asp.Ala.Met.Lys.Cys.Arg 
proposé  par  Thompson  (8.  tableau  3).  D'autres  contradictions  se  présentent 
entre  les  résultats  de  cet  auteur  et  ceux  du  présent  travail:  l'enchaînement 
Ala.Lys.Phe.Glu.Ser.Phe.Asp  (7  et  6.  tableau  1)  n'est  pas  compatible  avec 
l'enchaînement  Lys.Phe.Glu.Gly  (11.  tableau  3)  et  l'enchaînement  Thr.Pro. 
Gly.Ser.Arg  (T.7)  n'est  pas  compatible  non  plus  avec  l'enchaînement 
Val.Thr.Pro.Gly.Ala  (9.  tableau  3)  de  l'auteur  australien.  Dans  ce  dernier 
cas,  la  vadilité  de  l'enchaînement  proposé  ici  est  confirmée  par  les  obser- 
vations suivantes:  le  lysozyme  ne  contient  que  deux  résidus  de  proline,  or 
l'un  est  engagé  dans  l'enchaînement  Leu.Pro  (8.  tableau  1)  ou  Ileu.Pro 
(19.  Tableau  3),  l'autre  dans  l'enchaînement  Thr.Pro. Gly  trouvé  également 
par  Thompson.  Du  côté  N-terminal,  la  spécificité  de  l'action  de  la  trypsine 
suggère  que  Thr.  doit  être  précédé  d'un  résidu  d'acide  aminé  basique.  Du 
côté  C-terminal,  la  présence  d'un  résidu  d'alanine  lié  à  un  résidu  de  glycocolle 
n'a  jamais  pu  être  constatée.  7)  Il  convient  enfin  de  signaler  ici  l'incompati- 
bilité totale  entre  l'enchaînement  C-terminal  proposé  par  Ohno  et  celui  qui 
fait  l'objet  du  présent  exposé. 

Pour  expliquer  partiellement  les  contradictions  qui  viennent  d'être  signalées, 
on  peut  remarquer  que  les  peptides  étudiés  ici  ont  été  obtenus  essentiellement 
par  voie  enzymatique;  ils  sont  donc  relativement  longs  et  peu  nombreux 
et  assez  faciles  à  purifier,  alors  que  Thompson  a  obtenu,  après  hydrolyse 
acide  partielle,  un  mélange  complexe  de  di-  et  de  tripeptides  correspondant 
à  des  enchaînements  partiels  dont  la  position  à  l'intérieur  de  la  molécule 
de  lysozyme  est  très  difîicile  à  fixer  avec  certitude. 

REMARQUES    SUR    LA    SPÉCIFICITÉ    DE    LA    TRYPSINE 

L'action  de  la  trypsine  a  permis,  comme  on  vient  de  le  voir,  d'isoler  un 
certain  nombre  de  peptides  dont  la  structure  a  pu  être  déterminée.  En  outre 
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Tctude  de  cette  action  a  permis  de  préciser  certaines  particularités  concernant 
la  spécificité  de  l'enzyme.  On  constate  ainsi  que  la  trypsine  n'a  pas  scinde 
la  liaison  Lys.Glu  dans  l'enchaînement  du  peptide  TO.  16  (ou  T.8).  On  peut 
attribuer  la  résistance  de  cette  liaison  à  la  présence  du  groupement  carboxy- 
lique  libre  du  résidu  glutamique;  on  retrouve  d'ailleurs  la  même  résistance 
de  la  liaison  Lys.Glu  vis-à-vis  de  la  trypsine  dans  le  cas  de  la  ribonucléase.^^ 
Il  est  également  intéressant  de  noter  que  l'enchaînement  Arg.His.Lys  (I. 
tableau  1)  a  été  scindé  par  la  trypsine  uniquement  entre  Arg  et  His.  Il  n'a 
été  trouvé  ni  histidine  libre  ni  peptide  His. Lys.  On  pourrait  toutefois  penser 
que  si  le  résidu  lysine  dont  il  s'agit  ici  n'avait  pas  été  engagé  dans  la  liaison 
Lys.Glu  difficile  à  rompre,  une  certaine  quantité  du  peptide  His. Lys  eut 
été  trouvée.  Enfin,  il  convient  de  souligner  les  différences  dans  les  rende- 
ments de  leucine  libre  (leucine  C-terminale)  provenant  du  peptide  T0.9: 
le  rendement  obtenu  après  hydrolyse  du  lysozyme  simplement  dénaturé  est 
de  50  %;  ce  rendement  tombe  à  15  %  dans  le  cas  du  lysozyme  oxydé.  Ce 
fait  peut  être  expliqué  par  la  présence,  à  côté  de  l'arginine,  du  groupement 
sulfonique  du  résidu  acide  cystéique,  qui  par  sa  forte  acidité  doit  inhiber 
l'action  de  la  trypsine. 
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It  has  been  recognized  for  about  thirty  years  that  proteins  may  possess 
intrinsic  biological  activity  as  enzymes,  hormones,  antigens  and  antibodies, 
and  Northrop  (1932)  was  perhaps  the  first  to  formulate  and  try  to  answer 
the  next  question  :  Is  the  whole  protein  molecule  essential  for  these  activities 
or  is  it  a  property  of  only  part  of  the  molecule?  Northrop  attempted  to 
hydrolyse  crystalline  pepsin  without  loss  of  activity  but  failed.  Success  was 
soon  achieved  with  antibodies  (horse  anti-diphtheria  toxin  serum)  by 
Parventjev  (1936),  Petermann  and  Pappenheimer  (1941),  and  Northrop  (1942), 
the  last  two  isolating  approximately  half  molecules  which  would  still  com- 
bine with  the  antigen.  Later  a  fifth  part  of  rabbit  anti  ovalbumin  antibody 
was  obtained,  by  papain  digestion,  which  retained  the  specific  combining 
power  (Porter,  1950). 

Recently  the  partial  degradation  of  several  enzymes  without  loss  of 
activity  has  been  reported,  e.g.  pepsin  (Perlman,  1954),  papain  (Hill  and 
Smith,  1956)  and  ribonuclease  (Richards,  1955;  Kalnitsky  and  Rogers,  1956; 
Rogers  and  Kalnitsky,  1957).  The  degradation,  in  several  cases,  appeared 
to  be  very  considerable,  although  the  isolation  of  the  fragment  and  proof 
that  the  active  fragments  were  as  small  as  other  evidence  appeared  to  suggest 
has  not  yet  been  described.  Some  terminal  amino  acids  may  be  removed 
from  the  hormones  insulin  (Harris  and  Li,  1952),  adrenocorticotrophin  and 
growth  hormone  (Harris,  Li,  Condliffe  and  Pon,  1954)  and  also  from  tobacco 
mosaic  virus  (Harris  and  Knight,  1952)  without  loss  of  activity. 

The  combining  centre  of  a  protein  antigen  might  appear  to  offer  the 
easiest  opportunity  of  isolating  a  specifically  active  fragment  of  protein, 
as  Landsteiner's  work  with  artificially  conjugated  antigens  showed  that  the 
specificity  is  directed,  in  these  compounds,  to  the  small  attached  groups — 
the  haptenes.  This  he  showed  by  using  the  haptene  to  specifically  inhibit 
the  combination  of  the  conjugated  antigen  with  its  antiserum.  However,  a 
very  considerable  excess  of  haptene  is  required  to  achieve  inhibition  (a  molar 
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ratio  of  haptene  to  antigen  of  more  than  1000/1)  suggesting  that  the  haptene 
may  be  only  a  small  part  of  the  actual  combining  site.  The  only  attempt  to 
isolate  an  antigenic  combining  site  from  a  protein  was  carried  out  by  Land- 
steiner  (1942).  He  used  silk  fibroin  as  antigen  rendered  soluble  by  treatment 
with  acid  and  increased  its  antigenic  power  by  adsorption  onto  charcoal 
before  injecting  it  into  rabbits.  The  silk  fibroin  was  then  degraded  by  strong 
sulphuric  acid  and  the  digest  was  tested  for  its  ability  to  inhibit  the  com- 
bination of  antigen  and  antiserum.  Evidence  was  obtained  showing  that 
apparently  small  peptides  would  inhibit  this  combination  and  that  they 
must  retain  therefore  some  of  the  features  of  the  combining  sites  of  the 
original  antigen.  With  the  methods  available  at  the  time  it  was  not  possible 
to  characterize  the  active  fragments  further  and  as  the  original  antigen  was 
partially  degraded  fibroin  protein  it  was  not  clear  what  significance  these 
results  bore  relative  to  the  nature  of  the  combining  centres  of  native  globular 
protein  antigens.  Other  work  has  suggested  that  native  globular  proteins 
will  retain  the  power  to  combine  with  specific  antisera  after  at  least  slight 
enzymic  hydrolysis  (Landsteiner  and  Chase,  1933;  Holiday,  1939;  Klecz- 
kowski,  1945;  Lapresle,  1955a,  b). 

We  have  therefore  investigated  the  problem  further  and  have  found  thats 
if  crystalline  bovine  serum  albumin  is  hydrolysed  by  chymotrypsin  in  a 
dialysis  sack  suspended  in  water  brought  to  pH  8  with  NH3  at  25°,  then 
material  appears  in  the  outer  liquor  (the  diff"usate)  which  will  specifically 
inhibit  the  combination  of  bovine  serum  albumin  with  its  antiserum.  Full 
details  of  this  work  have  been  given  elsewhere  (Porter,  1957). 

It  was  found  that  the  active  material  (subsequently  referred  to  as  the 
inhibitor)  would  not  pass  through  a  diff'erent  sample  of  dialysis  tubing  and 
hence  after  concentration  in  a  rotary  still  it  was  redialysed  in  this  second 
sack  with  a  smaller  pore  size.  About  half  the  weight  was  lost  without  loss 
of  activity.  It  was  subsequently  purified  by  ethanol  precipitation  and  zone 
electrophoresis  and  finally  by  precipitation  with  5%  trichloroacetic  acid. 
The  precipitate  redissolved  in  water  and  there  was  no  loss  of  immunological 
activity,  but  small  adsorbed  peptides  were  removed. 

The  material  now  appeared  to  be  pure.  Electrophoresis  and  ultracentri- 
fugation  showed  only  one  component  and  end  group  assay  showed  the 
presence  of  only  A^-terminal  amino  acid — phenylalanine — present  as  1  mole 
per  14,000. 

Estimation  of  molecular  weight  by  sedimentation  gave  a  value  of  about 
24,000 — much  higher  than  had  been  expected — and  it  was  found  that  the 
purified  inhibitor  would  no  longer  pass  through  the  large  pore  size  dialysis 
tubing  as  it  had  done  in  the  first  stage  of  preparation.  It  was  eventually  shown 
that  dimerization  had  occurred  during  purification  and  that  this  could  be 
reversed  by  cysteine  at  pH  4.  Under  these  conditions  the  sedimentation  co- 
efficient corrected  to  water  at  20°  fell  from  2-13  to  1-42  and  the  calculated 
molecular  weight  from  24,000  to  12,000.  The  latter  figure  is  close  to  the 
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minimum  molecular  weight  found  by  end  group  assay  of  14,000,  The  mono- 
mer readily  dimerizes  on  being  brought  to  neutral  or  slightly  alkaline  reac- 
tion in  the  absence  of  a  reducing  agent.  The  dimerization  of  human  and 
bovine  serum  albumin  on  addition  of  mercuric  ion  or  a  divalent  organic 
mercurial  has  been  extensively  studied  at  Harvard  since  the  original  obser- 
vation of  the  phenomenon  by  Hughes  (1950).  It  was  shown  that  70-80% 
of  the  molecules  possess  a  single  SH  group  which  is  apparently  unable  to 
interact  with  another  SH  group  for  steric  reasons  but  which  will  cause 
dimerization  in  the  presence  of  Hg,  It  seems  probable  that  the  12,000  frag- 
ment which  we  have  isolated  possesses  this  single  sulphydryl  group  but 
owing  to  the  removal  of  four-fifths  of  the  original  molecule  the  steric  hind- 
rance has  been  eliminated  and  these  fragments  can  now  dimerize  freely. 
Amino  acid  analysis  shows  big  differences  (Table  1)  between  the  inhibitor 

Table  1 

AMINO  ACID   ANALYSIS   OF  INHIBITOR 
AND    OF    SERUM    ALBUMIN 

Results  are  expressed  as  g,/100  g.  of  protein 


Amino  acid 

Inhibitor 

Serum  albumin* 

Aspartic  acid 

90 

10-9 

Threonine 

7-2 

5-8 

Serine 

20 

4-2 

Glutamic  acid 

16-3 

16-5 

Proline 

7-3 

4-8 

Glycine 

1-9 

1-8 

Cystine 

2-7 

60 

Alanine 

5-6 

6-3 

Valine 

9-3 

5-9 

Methionine 

0-8 

0-8 

Leucine 

11-6 

12-3 

Isoleucine 

2-8 

2-6 

Tyrosine 

1-2 

5-1 

Plienylalanine 

61 

6-6 

Tryptophan 

Of 

0-58 

Lysine 

12-9 

12-8 

Histidine 

21 

4-0 

Arginine 

5-4 

5-9 

and  the  whole  molecule  and  it  may  be  noted  that  there  are  1  -4  molecules 

of  cystine  present  suggesting  that  there  is  one  disulphide  bond  and  a  free 

SH  group. 
The  immunological  activity  of  the  fragment  is  lost  if  it  is  dissolved  in 

■    *  The  amino  acid  content  of  serum  albumin  has  been  taken  from  Stein  and  Moore 
(1949). 

t  Tryptophan  could  not  be  detected  spectroscopically  with  the  Holiday  photographic 
method  (Beaven  and  Holiday,  1952), 
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Fig.  1.  Comparison  of  behaviour  of  bovine-serum  albumin  (BSA)  and  inhibitor  (I) 
on  diffusion  into  antiserum  in  agar  gel  by  the  method  of  Ochterlony  (1953).  Concen- 
trations, 5  0-5  and  005  mg./ml.  with  both  reagents.  Centre  cup  contains  rabbit 
anti-bovine  serum  albumin.  Photograph  was  taken  after  the  plate  had  been  standing 

for  5  days  at  2°. 
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a  strongly  reducing  medium  such  as  neutral  m  sodium  thioglycollate — the 
solution  turning  into  a  gel  on  standing  at  room  temperature.  Apparently 
when  the  internal  disulphide  bond  is  broken,  the  configuration  of  the  mole- 
cule is  disrupted  with  loss  of  solubility  and  biological  activity.  Further 
evidence  of  the  importance  of  configuration  was  obtained.  If  a  solution  of 
inhibitor  is  heated  to  100°  for  2  min.  a  visible  precipitate  forms  and  no 
activity  is  left.  If  the  inhibitor  is  dissolved  in  3m  urea  at  room  temperature 
then  the  activity  is  lost  slowly — it  is  reduced  to  about  half  in  24  hours.  It 
seems  clear  that  the  combining  site  of  the  fragment  is  a  relatively  large 
section  of  the  peptide  chain,  and  its  abihty  to  combine  with  antibody  is 
dependant  on  the  integrity  of  the  native  configuration.  This  is  lost  if  the 
disulphide  bond  is  broken  or  if  more  labile  bonds  are  broken  by  heat  or 
urea.  At  the  same  time  the  inhibitor  is  stable  to  ethanol  at  room  temperature 
and  also  to  pH  2  and  pH  10-5 — the  latter  observation  suggesting  that  electro- 
static bonds  are  not  important  in  preserving  the  native  configuration. 

When  the  immunological  properties  of  this  fragment  were  investigated  it 
became  clear  that  only  some,  perhaps  only  one,  of  the  combining  sites  of 
the  original  antigen  were  present  in  the  inhibitor.  For  example,  with  anti- 
serum from  rabbits  after  a  short  course  of  injections  with  bovine  serum 
albumin,  the  inhibitor  would  combine  and  subsequently  prevent  the  preci- 
pitation with  serum  albumin  of  about  one-third  of  the  antibody  present. 
This  power  to  combine  with  only  part  of  the  antibodies  could  be  demon- 
strated clearly  when  the  behaviour  of  whole  antigen  and  inhibitor  were 
compared  by  precipitation  in  agar  gel  according  to  the  technique  of  Ochter- 
lony.  In  Fig.  1  it  can  be  seen  that  a  typical  Y  type  of  interaction  occurs 
showing  that  the  inhibitor  is  combining  with  part  of  the  antibody,  the 
remainder  diffusing  through  until  it  meets  the  serum  albumin  front.  This 
presumably  means  that  antibodies  are  present  in  the  serum  which  are  directed 
to  antigenic  binding  sites  situated  in  the  four-fifths  of  the  molecule  which 
has  been  lost. 

It  was  interesting  to  observe  that  in  serum,  taken  from  animals  which 
had  been  given  a  prolonged  series  of  injections  of  serum  albumin,  the  in- 
hibitor could  combine  with  a  much  larger  proportion  of  the  antibodies 
present  and,  in  some  cases,  with  all  antibodies  present  giving  complete 
inhibition  of  precipitation.  This  is  taken  to  mean  that,  as  immunization 
continues,  the  antibodies  produced  are  directed  against  an  increasing  num- 
ber of  the  potential  antigenic  sites  on  a  protein.  In  the  example  quoted 
where  complete  inhibition  was  achieved  the  molar  ratio  required  for  this 
was  1-8  moles  inhibitor  to  1  mole  serum  albumin.  This  low  value  makes  it 
improbable  that  the  inhibition  is  competitive  and  other  evidence  such  as 
the  inability  of  excess  serum  albumin  to  reverse  the  inhibition  supports  this 
view.  This  low  ratio  is  also  in  contrast  to  the  results  of  Landsteiner  using 
haptenes  where  a  very  large  molar  excess  is  required  for  inhibition  of  pre- 
cipitation with  the  conjugated  antigen.  This  difference  may  arise  because 
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the  inhibitor  contains  the  whole  surface  of  the  combining  centre  which 
binds  more  firmly  to  the  antibody  site  than  the  smaller  haptenic  groups. 
The  importance  of  configuration  in  the  former  suggests  that  a  relatively 
large  surface  area  is  involved.  With  a  small  haptenic  group  the  secondary 
valence  forces  would  be  less  and  binding  correspondingly  less  firm. 

The  dimerization  of  the  inhibitor  had  no  influence  on  its  immunological 
behaviour  showing  that  the  surface  adjacent  to  the  sulphydryl  group,  which 
would  be  blocked  in  the  dimer,  could  not  be  a  part  of  the  combining  site. 
As  the  work  on  the  dimerization  of  the  whole  molecule  suggests  that  the 
sulphydryl  group  is  sunk  into  the  surface  and  is  not  readily  accessible,  it 
is  perhaps  not  surprising  that  it  is  not  part  of  a  combining  site.  Unless  an 
antigen  is  partly  broken  down  prior  to  its  participation  in  stimulating  anti- 
body formation,  it  would  be  expected  that  only  the  surface  of  the  molecule 
determines  the  complementary  character  of  the  antibody  combining  site. 
It  should  also  be  noted  that  the  part  of  the  inhibitor  surface  adjacent  to 
the  sulphydryl  group  could  play  an  essential  part  in  determining  the  con- 
figuration of  the  combining  centre,  though  not  itself  part  of  it. 

The  inhibitor  will  cause  anaphylactic  shock  when  injected  into  a  guinea 
pig  positively  immunized  with  rabbit  anti-bovine  serum  albumin  serum  and 
at  about  the  same  level  as  that  required  to  cause  shock  by  injection  of  the 
whole  antigen.  Landsteiner  and  Van  der  Scheer  (1933)  showed  that  azo  dye 
haptenes  would  cause  anaphylactic  shock  in  sensitized  guinea  pigs,  if  the 
dyes  used  had  two  haptenic  groups.  This  type  of  haptene  will  also  cause 
precipitation  of  antisera,  but  with  the  inhibitor  molecule  there  does  not 
appear  to  be  any  correlation  between  precipitating  power  and  ability  to 
cause  anaphylactic  shock. 

SUMMARY 

A  12,000  mol.  wt.  fragment  of  bovine  serum  albumin  has  been  isolated  from  a 
chymotryptic  digest. 

This  fragment  retained  the  power  to  combine  with  the  specific  antibody 
as  demonstrated  by  inhibition  and  precipitation  studies  and  also  by  its 
power  to  provoke  anaphylactic  shock  in  sensitized  guinea  pigs. 

The  fragment  readily  dimerizes  at  neutral  or  alkaline  reaction  and  evidence 
is  presented  which  suggests  that  this  dimerization  is  caused  by  interaction 
of  SH  groups,  probably  the  same  groups  which  lead  to  the  formation  of 
a  Hg  dimer  mercaptalbumin  with  the  original  molecule. 
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Biochemists  have  long  cherished  the  hope  that  the  phylogenetic  classifica- 
tion of  animals,  based  essentially  on  morphology,  would  find  a  correspond- 
ing chemical  expression  of  evolutionary  differences  between  animals.  The 
discussion  here  today  has  also  served  to  emphasize  the  relation  between 
species  specificity  and  protein  structure.  I  now  wish  to  present  some  recent 
data  which  illustrate  the  variation  in  proteins  that  occur  during  develop- 
mental changes  in  a  given  species. 

For  some  years  we  have  been  studying  the  chemical  changes  that  are 
associated  with  amphibian  metamorphosis.^  Recently,  in  collaboration  with 
Albert  Herner,  Lloyd  Fish,  and  Mrs  E.  J.  Casson  Lewis,  we  have  observed 
significant  alterations  in  serum  proteins  during  normal  and  induced  meta- 
morphosis.^ Marked  changes  in  both  the  amount  and  the  distribution  of 
the  serum  proteins  were  noted.  The  distribution  of  the  serum  proteins  was 
estimated  from  paper  electrophoresis  studies  of  serum  obtained  from  freshly 
clotted  tadpole  and  frog  blood. 

As  shown  in  Table  1,  the  Florida  swamp  frog  has  virtually  no  detectable 
protein  in  the  electrophoretic  fraction  corresponding  to  serum  albumin.  As 
metamorphosis  proceeds,  the  albumin  fraction  appears  and  increases  until 
a  normal  complement  of  45%  is  observed  for  the  frog  of  this  species.  The 
common  bullfrog  tadpole  has  a  small  fraction  of  serum  albumin  which  also 
increases  until  the  albumin  is  about  one-half  of  the  total  serum  protein. 
The  very  young  bullfrog  and  the  adult  bullfrog  possess  identical  propor- 
tions of  serum  albumin.  In  early  tadpoles  of  both  of  these  species,  the 
albumin  fraction  can  be  appreciably  increased  by  the  administration  of 
thyroid  hormone.  The  non-albumin  fraction  of  tadpole  serum  protein  seems 
to  be  predominantly  globulin  in  character. 

The  implications  of  these  observations  for  comparative  biochemistry  are 
too  numerous  and  lengthy  to  be  fully  discussed  here  and  will  be  presented  else- 
where. ^  We  think  that  it  is  not  unlikely  that  the  tadpole-frog  system  is 
reflecting  a  serum  protein  alteration  typical  of  the  change  from  aquatic  to 
terrestrial  forms. ^ 

Certain  subtle  changes  also  occur  in  the  hemoglobins  of  the  metamor- 
phosing bullfrog  tadpole  as  reported  previously  by  McCutcheon^    and  by 
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Table  1 

CHANGES    IN    SERUM    ALBUMIN    DURING    AMPHIBIAN 
METAMORPHOSIS 
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Florida  swamp  frog  (/?.  hechsherii) 
Common  bullfrog  {R.  catesbiand) 

„               „     injected  with  triiodothyronine 

Serum  albumin/total  serum 
protein 

Young  Tadpole 

Frog 

<002 
010  (±-03) 
0-20 

0-45 
0-49  (±05) 

Riggs.^  Our  laboratory  is  in  the  process  of  accumulating  sufficient  amounts 
of  these  various  blood  proteins  so  that  the  structural  basis  for  their  differences 
may  be  investigated.* 
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La  variabilité  allotypique 
de  certaines  protéines  du  serum 

JACQUES  OUDIN 

Laboratoire  d'Immunochimie  analytique,  Institut  Pasteur,  Paris 


Un  exemple,  connu  depuis  peu,  de  protéines  dont  la  structure  est  susceptible 
de  variations  individuelles  à  déterminisme  héréditaire,  est  celui  que  nous 
avons  signalé  récemment  chez  le  lapin. ^-^  Ces  variations  de  structure  se 
manifestent  par  des  différences  de  spécificité  antigénique,  d'un  individu  à 
l'autre  au  sein  d'une  même  espèce,  entre  des  protéines  qu'on  croyait  iden- 
tiques. Le  fait  qu'il  puisse  exister,  chez  deux  lapins,  deux  formes,  de 
spécificité  antigénique  différente — ou  allotypes — d'une  même  protéine, 
explique  qu'on  puisse  immuniser  l'un  des  deux  lapins  contre  une  ou  plusieurs 
protéines  du  sérum  de  l'autre. 

Le  procédé  d'immunisation  dont  nous  nous  sommes  servi  consiste  à  injecter 
à  un  lapin  selon  la  technique  décrite  par  Freund  et  ses  collaborateurs^  du 
précipité  spécifique  formé  par  la  réaction  de  l'immunsérum  d'un  autre  lapin 
avec  l'antigène  homologue.  La  nature  de  l'antigène  injecté  ne  semble  pas 
intervenir.  Lorsque  l'injection  entraine  la  formation  d'anticorps,  ceux-ci 
réagissent  avec  le  sérum  qui  a  servi  à  l'immunisation  et  avec  un  certain 
nombre  d'autres  serums  de  lapins.  La  quantité  d'anticorps  formés  peut 
être  telle  que  le  poids  d'azote  précipité  par  1  ml.  d'immunsérum  dépasse 
0,35  mg. 

La  nature  du  matériel  injecté  tendrait  à  suggérer  que  l'antigène  immuni- 
sant est  le  constituant  de  beaucoup  le  plus  abondant  dans  le  précipité  spéci- 
fique, c'est  à  dire  l'anticorps.  Nous  avons  pu  vérifier  dans  plusieurs  cas 
qu'il  en  était  ainsi.  Un  sérum  précipitable,  contenant  l'antigène,  était  un 
immunsérum  agglutinant  S,  Typhi  (suspension  0)  à  la  dilution  1/3200.  Le 
même  sérum  agglutinant  a  été  additionné  de  20  fois  son  volume  d'un  im- 
munsérum antilapin,  et  le  précipité  formé  éliminé  par  centrifugation.  Le 
sérum  anti  S.  Typhi  ainsi  débarassé  de  son  antigène  précipitable  par  l'im- 
munsérum antilapin  n'agglutinait  plus  S.  Typhi  qu'à  la  dilution  1/800.  Nous 
avons  observé  dans  des  conditions  analogues  la  diminution  du  titre  H  d'un 
autre  sérum  agglutinant  de  lapin  anti  S.  Typhi.  La  diminution  du  titre 
agglutinant  était  évidemment  due  dans  les  deux  cas  à  l'élimination  préalable 
d'une  partie  des  anticorps  anti  S.  Typhi  dans  la  réaction  où  ils  avaient  été 
précipités  en  tant  qu'antigène. 
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Il  n'est  nullement  nécessaire  que  le  sérum  précipitable  soit  celui  d'un 
animal  hyperimmunisé;  dans  la  plupart  des  réactions  effectuées,  l'antigène 
se  trouvait  dans  des  serums  de  lapins  normaux,  parfois  même  dans  des 
serums  de  lapins  nouveau-nés.  Ces  réactions  ont  été  effectuées  pour  une 
part  en  milieu  liquide;  mais  nous  avons  été  amené  à  utiliser  de  plus  en 
plus  la  précipitation  en  milieu  gélifié  selon  la  technique  de  diffusion  simple 
à  une  dimension,  en  tubes.*-^-^  Dans  les  réactions  de  plusieurs  serums,  con- 
tenant l'antigène,  avec  un  même  immunsérum,  cette  technique  permet  en 
effet  de  mieux  apprécier  l'égalité  ou  la  différence  d'intensité  de  la  réaction 
selon  la  densité  de  précipité  de  la  zone,  de  déceler,  même  à  intensité  égale, 
des  différences  d'aspect,  et  naturellement  de  démontrer,  par  la  multiplicité 
des  zones  de  précipitation,  que  plusieurs  antigènes  réagissent. 

Parmi  les  immunsérums,  choisis  après  un  grand  nombre  d'immunisations 
comme  donnant  avec  les  mêmes  serums  de  lapins  des  réactions  de  types 
différents,  les  quatre  immunsérums  dont  les  réactions  sont  représentées  au 
tableau  1  peuvent  servir  d'exemples  simples.  Ces  immunsérums  ne  donnent 
chacun,  en  effet,  qu'une  seule  zone  de  précipitation,  d'intensité  constante.  Le 

Tableau   1 

RÉACTIONS    DE    4    IMMUNSÉRUMS    DE    LAPINS    AVEC    50    SERUMS 
DE    LAPINS    NON    IMMUNISÉS 


Nombre  total 

Immunsérums 

Réactions  (gel)  de  50  serums  de 

de  réactions 

(Nos  des  lapins) 

lapins  non  immunisés 

positives 
avec  chaque 
immunsérum 

420 

+ 

+ 

+ 

+ 

+ 

+ 

o 

o 

48 

432 

+ 

+ 

+ 

o 

o 

o 

+ 

o 

22 

194 

+ 

o 

o 

+ 

o 

o 

+ 

+ 

15 

287 

0 

+ 

o 

o 

+ 

o 

o 

o 

7 

Nombre  de  serums  don- 

nant chaque  type  de 

réaction 

3 

3 

15 

10 

4 

13 

1 

1 

~ 

tableau  indique  donc  seulement,  par  +  pu  o,  la  présence  ou  l'absence  de  la  réac- 
tion manifestée  par  une  zone  de  précipitation.  On  peut  admettre  que  chacun 
de  ces  immunsérums  contient  des  anticorps  contre  une  structure  spécifique 
qui  se  trouve  être  présente  dans  chacun  des  serums  avec  lesquels  il  précipite, 
et  absente  des  autres  serums.  Il  est  commode  d'appeler  'motifs  allotypiques' 
les  structures  de  ce  genre,  sans  préjuger  de  leur  nature  chimique.  On  voit 
que  la  fréquence  des  motifs  allotypiques  responsables  des  réactions  du 
tableau  1  est  très  variable:  48/50,  22/50,  15/50,  7/50.  Pour  qu'une  réaction 
ait  lieu,  il  faut  que  le  motif  allotypique  qui  en  est  responsable  soit  présent 


300  JACQUES  OUDIN  [17 

dans  le  sérum  immunisant  et  non  dans  le  sérum  de  l'animal  immunisé. 
Aucune  réaction  ne  peut  évidemment  intéresser  un  motif  qui  est,  soit  présent 
dans  l'un  et  l'autre,  soit  absent  de  l'un  et  de  l'autre.  Lorsque  la  fréquence  d'un 
motif  allotypique  est  très  faible,  la  réalisation  de  ces  conditions,  nécessaires 
(mais  non  suffisantes)  à  l'apparition  d'anticorps,  devient  très  peu  probable. 
Il  en  est  de  même  pour  un  motif  de  fréquence  très  élevée.  Or  les  fréquences 
indiquées  au  tableau  1  nous  permettent,  au  prix  d'une  extrapolation  vrai- 
semblable, de  supposer  que  d'autres  motifs  allotypiques  existent,  soit  plus 
fréquents,  soit  moins  fréquents  que  ceux  dont  il  s'agit,  dans  la  population 
étudiée.  La  probabilité  que  ces  motifs  rares  ou  fréquents  échappent  aux 
recherches  peut  donc  être  très  grande.  D'autre  part,  l'impression  que  don- 
nent des  motifs  allotypiques  différents,  de  ne  pas  susciter  aussi  aisément  les 
uns  que  les  autres  la  formation  d'anticorps,  est  une  raison  supplémentaire 
de  supposer  que  les  motifs  allotypiques  décelés  ne  représentent  qu'une  très 
petite  partie  de  ceux  qui  existent. 

Si  l'on  répartit  les  individus  en  classes  telles  qu'à  l'intérieur  de  chaque 
classe  tous  les  serums  donnent  exactement  les  mêmes  réactions  avec  tous 
les  immunsérums  dont  on  dispose,  le  nombre  des  classes  théoriquement 
possibles  croît  exponentiellement  en  fonction  du  nombre  des  motifs  allo- 
typiques qu'on  est  capable  de  déceler.  Encore  faudrait-il,  pour  qu'on  puisse 
observer  réellement  le  nombre  ainsi  prévisible  de  classes  différentes  (à  sup- 
poser qu'on  examine  les  serums  d'un  nombre  suffisant  d'individus),  que  la 
présence  d'un  motif  n'entraine  nécessairement  ni  la  présence  ni  l'absence 
d'aucun  autre  motif  Dans  le  cas  où  la  présence  d'un  motif  entraînerait 
obligatoirement  la  présence  d'un  autre  motif  donné,  on  ne  pourrait  distinguer 
ces  deux  motifs  qu'à  la  condition  qu'ils  soient  portés  par  deux  molécules 
différentes.  La  réaction  en  milieu  gélifié  pourrait  alors  déceler  leur  réaction 
sous  forme  de  deux  zones  de  précipitation  distinctes,  mais  toujours  jumelées. 
Certaines  observations  que  nous  avons  faites  pourraient  s'expliquer  de  cette 
façon.  Dans  l'ensemble,  la  plupart  des  réactions  observées  décèlent  des 
motifs  qui  peuvent  exister  l'un  sans  l'autre.  Mais  il  semble  qu'il  y  ait  parfois, 
pour  deux  motifs  allotypiques  donnés,  une  certaine  tendance  à  se  trouver  plus 
souvent,  soit  présents  tous  deux  à  la  fois,  soit  même  présent  pour  l'un  et 
absent  pour  l'autre. 

Du  fait  que  deux  motifs  allotypiques  peuvent  exister  l'un  sans  l'autre, 
il  ne  s'ensuit  pas  nécessairement  qu'ils  soient  hérités  séparément.  L'expérience 
montre  qu'ils  peuvent  l'être:  le  tableau  2  donne  l'exemple  des  réactions  des 
lapins  de  deux  portées,  comparées  à  celles  des  parents.  Les  conclusions  qui 
peuvent  être  tirées  d'autre  part  des  croisements  effectués  sont  actuellement 
limitées  par  l'ignorance  où  nous  sommes  de  l'ascendance,  et  partant  du 
génotype,  des  parents. 

Rien  ne  permet  jusqu'à  présent  de  supposer  qu'un  état  pathologique 
quelconque  puisse  être  la  conséquence  des  variations  allotypiques  que  nous 
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Tableau  2 

RÉACTIONS    DES    SERUMS    DES    LAPINS    DE    DEUX    PORTÉES, 
ET    DES    SERUMS    DES    PARENTS 


Parents 

Portée  A 

Parents 

Portée  B 

Immunserums 

Nos 

10 

32 

60 

61 

62 

63 

64 

44 

52 

101 

102 

109 

6 

? 

? 

? 

? 

e 

S 

6 

? 

ê 

o" 

? 

420 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

432 

o 

+ 

o 

+ 

+ 

o 

+ 

+ 

+ 

+ 

+ 

+ 

194 

o 

o 

o 

0 

o 

o 

o 

o 

+ 

o 

+ 

+ 

287 

+ 

o 

+ 

+ 

o 

+ 

0 

+ 

o 

+ 

+ 

o 

avons  observées,  comme  c'est  le  cas  pour  les  variations  individuelles  de 
structure  de  l'hémoglobine.'^ 

Il  est  vraisemblable  que  le  phénomène  dont  il  s'agit  pourra  être  observé 
dans  beaucoup  d'autres  espèces  que  le  lapin.  Nous  avons  pu  déjà  déceler, 
dans  les  mêmes  conditions  que  chez  le  lapin,  des  réactions  intraspécifiques 
analogues,  quoique  d'intensité  plus  faible,  chez  le  cobaye,  donnant  à  penser 
que  dans  cette  espèce  aussi  une  ou  plusieurs  protéines  du  sérum  sont  sus- 
ceptibles de  variabilité  allotypique. 
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Species  variation  and  structural  aspects 
in  some  pituitary  hormones 

CHOH  HAO  LI 

The  Hormone  Research  Laboratory  and  the  Department  of  Biochemistry, 
University  of  California,  Berkeley,  California 


The  pituitary  body  is  distinguished  from  all  the  other  endocrine  glands  by 
its  structural  division  into  three  main  parts.  The  posterior  lobe,  connected 
with  the  brain  by  the  infundibulum,  is  the  secretory  site  of  the  two  peptide 
hormones,  vasopressin  and  oxytocin.  The  intermediate  lobe,  which  controls 
the  dispersion  of  pigment  granules,  lies  between  the  anterior  lobe  and  the 
posterior  lobe.  It  is,  however,  the  glandular  or  anterior  lobe  that  is  respon- 
sible for  the  physiological  dominance  exercised  by  the  pituitary  over  the 
other  glands  of  internal  secretion.  Fig.  1  presents  a  diagrammatic  summary 
of  the  biological  properties  of  the  pituitary  hormones. 

Despite  the  intensive  investigations  into  the  various  biological  functions 
of  the  anterior  pituitary  which  have  been  carried  out  in  recent  years,  there 
is  as  yet  no  conclusive  evidence  for  attributing  the  functions  of  this  lobe  to 
any  new  hormone  other  than  the  six  already  established ;^-2  namely,  growth 
hormone  (GH,  or  somatotropin  (STH)),  corticotropin  (ACTH),  thyrotropin 
(TSH),  lactogenic  hormone  (luteotropin,  prolactin),  foUicle-stimulating  hor- 
mone (FSH),  and  interstitial-cell  stimulating  hormone  (ICSH,  or  luteinizing 
hormone  (LH)).  These  hormones  are  protein  or  peptide  in  nature,  and  three 
of  them  (TSH,  FSH,  and  ICSH)  appear  to  contain  carbohydrates  in  addi- 
tion to  amino  acids. ^  These  three  carbohydrate-containing  hormone  pro- 
teins have  not  been  isolated  in  pure  form  and  hence  no  investigations  of 
their  structure  have  been  undertaken. 

The  isolation,  structure  and  synthesis  of  the  posterior  pituitary  hormones 
have  recently  been  accomplished  by  du  Vigneaud  and  his  collaborators.^ 
This  brilhant  achievement  is  well  known  and  shall  not  be  discussed  here, 
except  to  note  that  an  interesting  species  difference  was  observed  between 
pig  and  beef  vasopressins,  the  former  peptide  hormone  containing  lysine 
instead  of  the  arginine  found  in  the  latter.  On  the  other  hand,  the  oxytocin 
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molecules  from  beef  and  pig  pituitaries  manifest  no  difference  with  respect 
to  amino  acid  composition. 

The  speculation  that  the  melanocyte-stimulating  hormone  (MSH,  inter- 
medin) of  the  intermediate  lobe  and  ACTH  might  be  identical^ -^  has  given 
a  new  impetus  to  the  studies  on  the  former  hormone.  This  has  led  to  recent 
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Fig.  1 .  A  diagrammatical  summary  of  the  biological  properties  of  pituitary  hormones. 
CL.= corpus  luteum. 

successes  in  the  isolation  and  structural  elucidation  of  the  hormone  from 
both  pig  and  beef  pituitary  glands.'  In  this  paper,  we  shall  discuss  some 
recent  data  derived  from  structural  investigations  of  MSH,  the  cortico- 
tropins, and  two  anterior  pituitary  hormones  (somatotropin  and  prolactin), 
especially  with  regard  to  species  variations. 


I.    MELANOCYTE-STIMULATING    HORMONE 
(MSH,    INTERMEDIN) 

In  1916,  Smith^  and  Allen^  independently  reported  a  striking  change  in  the 
pigmentation  of  the  tadpole  following  hypophysectomy.  According  to 
Smith, ^"-^i  the  removal  of  the  hypophysis  in  the  tadpole  is  followed  by  a 
decrease  in  the  amount  of  intracellular  melanin  as  well  as  in  the  number 
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of  the  epidermal  melanophores.  Subsequently,  Allen^-  and  Swingle^^  showed 
that  transplantation  of  the  intermediate  lobe  of  adult  frogs  into  normal  or 
hypophysectomized  tadpoles  induces  a  marked  expansion  of  the  melano- 
cytes. It  was  further  demonstrated^^ '^^  that  extracts  of  the  intermediate 
lobes  of  the  bovine  pituitary  exercised  the  most  marked  influence  on  the 
pigmentation  of  the  hypophysectomized  tadpole,  although  suspensions  of 
the  anterior  lobes  were  also  effective  in  this  regard.  These  earlier  investiga- 
tions have  left  no  doubt  that  the  pituitary  gland  possesses  a  principle  or 
principles  which  regulate  the  pigmentation  of  amphibia. 

The  main  sources  of  starting  material  for  the  purification  of  the  melanocyte- 
stimulating  hormone  have  been  the  posterior-intermediate  lobes  of  beef  and 
pig  pituitary  glands.  The  pioneer  studies  of  Hogben,  Zondek,  lores,  Stehle, 
Landgrebe,  and  their  collaborators,  have  shown  the  hormone  to  be  a  poly- 
peptide of  comparatively  low  molecular  weight.  Since  1955,  four  groups  of 
investigators^^-^^'^'-^^-^^-^"'^^-^^  have  published  methods  for  the  purification 
and  isolation  of  MSH  from  beef  and  pig  pituitary  glands.  All  the  pubhshed 
procedures  for  the  preparation  of  MSH  concentrates  from  either  pig  or  beef 
posterior-intermediate  pituitaries  incorporate  the  acetic  acid  extraction  tech- 
nique first  introduced  by  Kamm  et  al.^^  for  posterior  pituitary  hormones 
and  later  developed  by  Payne  et  al.^^  for  ACTH.  In  addition,  oxycellulose 
is  employed  to  adsorb  the  MSH  activity  from  the  acetic  acid  extract  accord- 
ing to  the  procedure  commonly  used  for  the  corticotropins. ^^  From  these 
MSH  concentrates,  highly  purified  or  pure  preparations  may  be  obtained 
with  zone  electrophoresis^^  or  countercurrent  distribution^^  or  a  combina- 
tion of  these  two  techniques. ^^•^''•^^•^^ 

Lee  and  Lerner^^  reported  the  presence  in  hog  pituitary  glands  of  two 
substances  possessing  MSH  activity;  they  observed  two  active  peaks  when 
pituitary  powder  prepared  from  either  posterior  or  anterior  lobes  was  sub- 
mitted to  countercurrent  distribution  in  a  2-butanol-0-5%  aqueous  tri- 
chloroacetic acid  system.  The  preparation  with  a  high  partition  coeflScient 
was  designated  as  a-MSH,  and  the  other  component  as  j3-MSH.  These 
investigators  beheve  that  a-MSH  represents  the  main  MSH  component  of 
the  hog  pituitary  gland  because  it  accounts  for  the  greatest  percentage  of 
the  total  MSH  activity.  They  ascribe  the  fact  that  other  workers  have  ob- 
tained jS-MSH  as  the  main  component  to  the  step  in  the  fractionation  pro- 
cedure of  Landgrebe  and  Mitchell^^*  involving  precipitation  with  acetone, 
during  which  they  postulate  that  the  a-MSH  is  separated  from  the  jS-MSH 
and  discarded  in  the  lyophilized  filtrate.  However,  none  of  the  other  investi- 
gators^^-^^-2°-^^  have  detected  the  presence  of  a-MSH  in  the  course  of  the 
purification  of  their  MSH  preparations. 

The  structure  of  the  pig  jS-MSH  has  been  elucidated  independently  in 

*  This  explanation  cannot  be  applied  to  the  preparation  of  Geschwind  and  Li  ;2i  in 
their  procedure  the  supernatant  fluid  was  lyophiHzed  instead  of  being  precipitated  with 
acetone. 
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two  laboratories. "'^^'^"^  The  data  obtained  by  Harris  and  Roos^"  and 
Geschwind  et  al.^^'^^  are  in  complete  agreement.  Using  techniques 
developed  for  structural  studies  on  insulin^^  and  on  the  corticotropins,^" 
both  groups  of  investigators  obtained  the  following  amino  acid  sequence 
for  iS-MSH: 

H-Asp.Glu.Gly.Pro.Tyr.Lys.Met.Glu.His.Phe.Arg.Try.Gly.Ser.Pro.Pro.Lys.Asp-OH 
1         2      3      4      5      6      7      8      9     10    11     12    13    14    15    16    17   18 

It  has  been  shown  that  neither  /3-aspartyl,  nor  y-glutamyl,  nor  e-amino 
linkages  occur  in  ^-MSH.^^  The  two  residues  of  aspartic  acid  that  occur  in 
the  peptide  occupy  the  A^-  and  C-terminal  positions  in  the  chain.  Hydro- 
lysis of  the  hormone  peptide  with  trypsin  and  chymotrypsin  has  been  found 
to  proceed  in  accordance  with  the  known  specificity  of  these  proteolytic 
enzymes,  with  the  one  exception  that  the  Lys. Asp  linkage  (positions  17-18) 
is  resistant  to  cleavage  by  trypsin. 

Although  preparations  of  active  MSH  extracts  from  bovine  pituitary  have 
been  reported,^^-^^  the  isolation  of  a  bovine  MSH  in  pure  form  has  only 
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Fig.  2.  Resolution  of  chymotryptic  digest  of  beef  /S-MSH  by 
paper  electrophoresis  in  y-collidine-acetic  acid  buffer  of  pH  7-0; 
11  v/cm. 
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recently  been  achieved. ^2*  The  complete  amino  acid  sequence  of  the  bovine 
MSH  has  now  also  been  elucidated  by  various  methods,  as  shown  in  Table  1. 
It  may  be  seen  that  the  structure  proposed  for  the  bovine  hormone  differs 
from  that  proposed  for  the  pig  jS-MSH  only  at  position  2,  where  a  seryl 

*  Very  recent  investigations  with  I.  I.  Geschwind  indicate  that  about  80%  of  the  MSH 
component  isolated  from  sheep  posterior-intermediate  pituitaries  is  identical  with  the 
bovine  j8-MSH  in  both  composition  and  the  amino  acid  sequence,  whilst  the  remaining 
active  material  is  identical  with  the  pig  hormone. 
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residue  replaces  a  glutamyl.  This  replacement  of  an  uncharged  residue 
(serine)  by  a  charged  one  (glutamic  acid)  has  not  to  our  knowledge  been 
encountered  previously  in  connection  with  species  variation  among  bio- 
logically active  peptides.  Because  of  this  structural  difference,  the  isoelectric 
point  of  the  beef  hormone  (pH  7-0)  is  higher  than  that  of  pig  ^-MSH  (pH 
5-8).  The  latter  was  found  to  migrate  more  slowly  on  starch  in  a  pyridine- 
acetate  buffer  of  pH  4-93  than  does  the  former  (Fig.  3).  Because  of  the 
presence  in  the  bovine  MSH  of  serine  at  position  2  instead  of  glutamic 
acid,  it  would  be  expected,  on  the  basis  of  the  known  specificity  of  leucine 
aminopeptidase  (LAP),^^  that  the  rate  of  release  of  amino  acids  from  this 
hormone  by  the  enzyme  would  be  considerably  slower  than  it  would  be 
from  the  pig  ^-MSH.  That  this  was  indeed  found  to  be  the  case  can  be  seen 
from  Fig.  4. 

According  to  the  //;  vitro  frog  skin  assay,^^  beef  MSH  possesses  a  con- 
siderably lower  potency  than  does  pig  ^-MSH,  It  will  be  of  interest  to  find 
out  whether  this  difference  in  activity  between  the  hormones  of  the  two 
species  is  due  mainly  to  the  difference  in  the  amino  acid  at  position  2  of 
the  sequence,  or  whether  other  factors,  such  as  secondary  structure,  are 
involved.  This  question  can  be  resolved  only  by  the  synthesis  of  the  two 
melanocyte-stimulating  peptides.*  Table  2  summarizes  some  physical  and 
chemical  characteristics  of  the  pig  and  beef  hormones. 

II.    ADRENOCORTICOTROPIC    HORMONE 
(ACTH,    CORTICOTROPIN) 

It  has  been  well  established  that  hypophysectomy,  in  almost  all  the  species 
that  have  been  studied,  brings  about  a  pronounced  atrophy  of  the  cortex 
of  the  adrenal  glands,  whereas  the  medulla  is  affected  scarcely  at  all.^-^^ 
On  the  other  hand,  the  implantation  of  anterior  pituitary  tissue  or  the  injec- 
tion of  pituitary  extracts  into  normal  animals  have  as  their  chief  effect  a 
marked  hypertrophy  of  the  adrenal  cortex.  Hence,  the  name  adrenocortico- 
tropic hormone  (ACTH)  has  been  employed  to  designate  the  active  prin- 
ciple in  pituitary  extracts  that  accomplishes  repair  of  adrenal-cortical  atrophy 
in  hypophysectomized  animals,^  and  in  1951  the  term  corticotropin  was 
adopted  as  the  formal  designation  for  preparations  possessing  ACTH 
activity.^^ 

From  unhydrolyzed  ACTH  concentrates,  three  groups  of  investigators  have 
succeeded  independently  in  isolating  the  active  peptide  in  pure  form.  In 
1953,  White,^^  in  the  Armour  Laboratories,  described  the  preparation  of 

*  For  consonance  with  the  pattern  of  terminology  employed  for  the  other  pituitary 
hormones,  it  was  proposed  that  the  melanocyte-stimulating  hormones  be  called  melano- 
tropinsJ  Following  the  precedent  set  by  du  Vigneaud'*  for  the  vasopressins,  the  beef  MSH 
could  be  differentiated  from  the  pig  j3-MSH  by  designating  the  former  seryl-melanotropin 
and  the  latter  glutamyl-melanotropin.  These  designations  could  be  abbreviated  as  seryl- 
JS-MSH  and  glutamyl- i3-MSH. 
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an  apparently  pure  peptide  from  pig  pituitary  glands,  which  they  designated 
Corticotropin-A.  The  following  year  Li  and  co-workers^^  at  the  University 
of  California  reported  the  isolation,  from  a  sheep  ACTH  concentrate  that 
had  not  been  submitted  to  peptic  digestion,  of  a  peptide  which  behaves  like 
a  pure  substance.  The  California  group  designated  this  unhydrolyzed  ACTH 
obtained  from  sheep  pituitaries  a-corticotropin,  since  some  of  its  chemical 
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Fig.  3.  Zone  electrophoresis  of  beef  /3-MSH  {à-A)  and  a  mixture  of  beef  jS-MSH  and 
pig  /S-MSH  (o-o)  on  starch;  pyridine-acetate  buffer  of  pH  4-83  and  1= 01  ;  200  v  for  24 
hours. 
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Fig.  4.  Action  of  leucine  aminopeptidase  on  beef  and  pig  jS-MSH;  0-6  mg.  en2yme/l-4 
mg.  peptide;  pH  8-1,  0-02  M  Tris  buffer  containing  0  002  M-Mg++  at  40°C. 
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and  physicochemical  characteristics  differ  from  those  described  for  cortico- 
tropin-A.  In  the  same  year,  BelF^  and  his  colleagues  of  the  American  Cyana- 
mid  Company  announced  the  purification  of  j8-corticotropin  from  ACTH 
concentrates  of  pig  pituitaries.  Since  both  the  Armour  and  American  Cyana- 
mid  groups  employed  unhydrolyzed  pig  ACTH  concentrates  as  their  starting 
material,  it  cannot  be  assumed  that  corticotropin-A  and  )8-corticotropin  are 
different  peptides.  The  isolation  of  a  polypeptide  possessing  adrenocortico- 
tropic activity  from  bovine  pituitaries  in  pure  form  has  only  recently  been 
described.  ^^  Physicochemical  properties  of  the  bovine  hormone  are  identical 
in  every  respect  with  that  of  the  sheep  a-corticotropin.  The  corticotropins 
from  all  three  species  are  made  up  of  39  amino  acids  in  a  single  polypeptide 
chain  with  serine  as  the  A^-terminal  amino  acid  and  phenylalanine  at  the 
C-terminus. 

The  complete  amino  acid  sequences  of  the  pig  corticotropin-A  and  of 
the  sheep  a-corticotropin  have  been  eludicated.^"-*^  Preliminary  structural 
investigations  on  the  beef  corticotropin*-  indicated  that  its  amino  acid 
sequence  is  identical  with  that  of  the  sheep  a- ACTH.*"  It  may  be  seen  in 
Table  3  that  the  complete  sequence  was  derived  from  the  identification  of 
over  40  different  peptide  fragments  obtained  by  chymotryptic,  tryptic,  peptic 
and  partial  acid  hydrolysis,  as  well  as  by  submitting  the  whole  molecule 
to  stepwise  degradation  by  the  phenylisothiocyanate  (PITC)  procedure  of 
Edman  and  to  kinetic  investigations  with  leucine  aminopeptidase  (LAP)  and 
carboxypeptidase,  and  from  A^-  and  C-terminal  residue  analyses  by  the 
fluorodinitrobenzene  (FDNB)  procedure  of  Sanger  and  hydrazinolysis  ac- 
cording to  the  procedure  of  Akabori,  respectively. 

There  are  two  amide  groups  in  sheep  a-corticotropin.  The  location  of 
one  of  these  groups  in  position  33  was  recently  established  in  the  course  of 
a  kinetic  investigation  carried  out  with  J.  Léonis  and  D.  Chung  on  the 
effect  of  chymotryptic  digestion  on  the  hormone  peptide.  The  location  of 
the  other  amide  is  yet  to  be  determined,  but  there  have  been  no  indications 
that  it  occurs  in  position  30  as  it  does  in  the  case  of  pig  corticotropin-A 
(Cyanamid).*"  In  this  connection,  it  is  of  interest  to  note  that  pig  cortico- 
tropin-A (Armour)  possesses  no  amides  at  all.^^  Moreover,  there  are  some 
variations,  reported  by  three  groups  of  investigators,*"  ^^-^^  with  respect  to 
the  amino  acids  in  positions  25-28,  as  shown  in  Table  4.  It  may  be  noted 
that  the  component  amino  acids  in  this  segment  are  the  same  in  all  cases; 
only  the  order  differs.  Moreover,  the  only  difference  in  amino  acid  composi- 
tion between  the  peptide  hormones  isolated  from  sheep  (beef)  and  pig  glands 
appears  to  be  one  more  serine  and  one  less  leucine  in  the  former,  ^•'-^^  owing 
to  an  . . .  Ala.Ser . . .  sequence  in  the  sheep  peptide  in  place  of  a  . . .  Leu.  Ala . . . 
sequence  in  the  pig  peptide  at  amino  acid  positions  31-32. 

Despite  their  differences  in  amino  acid  composition,  and  variations  in 
certain  portions  of  their  amino  acid  sequence,  both  sheep  a-corticotropin  and 
pig  corticotropin-A  possess  identical  specific  ACTH  activity.  Consequently, 
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it  may  be  inferred  that  those  portions  of  the  molecule  containing 
these  differences  are  not  essential  for  biological  potency.  Indeed,  limited 
hydrolysis  of  both  corticotropins  with  pepsin  releases  11  amino  acid  resi- 
dues from  the  C-terminus  without  loss  of  adrenal-stimulating  activity. '*^'*^'^^''*^ 
Further  hydrolysis  of  this  active  core  with  dilute  acid  did  not  result  in  any 
loss  of  ACTH  activity.^''  Recently,  the  synthesis  of  the  first  20  amino  acids 
from  the  A^-terminus  of  the  corticotropins  has  been  achieved  by  Boissonnas 


Table  4 

VARIATIONS  IN  AMINO  ACID   SEQUENCES   AMONG   DIFFERENT 
PREPARATIONS    OF    ACTH 


Preparation 

Species 

Residue  No. 

25    26    27    28    29    30 

31    32    33 

a-Corticotropin 
Corticotropin-A 

(Cyanamid) 
Corticotropin-A 

(Armour) 

sheep 
pig 
pig 

Ala.Gly.Glu.Asp.Asp.Glu 

Asp.Gly.Ala.Glu.Asp.Glu.NH2 

Gly.Ala.Glu.Asp.Asp.Glu 

Ala.Ser.Glu.NH2 

Leu.Ala.Glu 

Leu.Ala.Glu 

et  al.,^'^  and  the  synthetic  icosapeptide  has  been  demonstrated  to  possess 
ACTH  activity.  Thus,  both  the  sheep  and  pig  corticotropins  appear  to  have 
an  identical  active  core  that  is  responsible  for  their  hormonal  function. 

The  behavior  of  corticotropin-A  and  a-corticotropin  in  countercurrent 
distribution  reveals  marked  differences  in  the  partition  coefficients  of  these 
peptides  in  various  solvent  systems.  For  example,  the  partition  coefficient 
of  corticotropin-A  in  a  2-butanol/0-2%  TCA  system  is  1-82,^^  whereas  that 
of  a-corticotropin  in  the  same  system  is  O-Tl."*^  In  a  solvent  system  made 
up  of  6%  acetic  acid  containing  3-5%  NaCl/«-BuOH,  corticotropin-A  (Cyan- 
amid) has  a  K  value  of  0-95,^^  whereas  that  of  a-corticotropin  in  the  same 
system  is  less  than  O-l.*^  It  is  unhkely  that  these  differences  are  due  to 
variation  in  amide  content  among  the  various  corticotropin  preparations, 
since,  even  though  there  is  a  difference  of  2  amide  groups  between  two 
preparations  of  a-corticotropin,  they  cannot  be  differentiated  by  their  be- 
havior in  countercurrent  distribution  in  a  2-butanol-aqueous  TCA  system.*^ 

There  are  also  some  indications  that  corticotropin-A  and  a-corticotropin 
may  be  distinguished  from  each  other  according  to  their  behavior  in  enzy- 
matic hydrolysis.  The  peptide  fragment  Lys.Arg.Arg.Pro.Val.Lys  occurs  in 
the  tryptic  digest  of  corticotropin-A  (Cyanamid)*^  but  it  is  not  encountered 
in  similar  digests  of  a-corticotropin.^^  Moreover,  the  sheep  hormone  appears 
to  be  more  labile  toward  pepsin  than  is  the  pig  hormone.  It  is  not  improb- 
able that  two  polypeptides  possessing  similar  amino  acid  sequences  may 
differ  in  the  spatial  arrangement  (helices)  of  the  residues  which  compose  it, 
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and  yet  may  have  identical  biological  activity.  Table  5  summarizes  some 
properties  of  pig  corticotropin-A  and  sheep  a-corticotropin. 

Table  5 
SOME    PROPERTIES    OF    CORTICOTROPINS 


a-Corticotropin 

Corticotropin-A  (Pig) 

(Sheep) 

Molecular  weight 

Armour 

Cyanamid 

Analytical  data 

4541 

3500 

4567 

Ultracentrifuge 

5363* 

— 

4390t 

Isoelectric  point,  pH 

6-6 

7-8 

— 

Sedimentation  constant,  ^20 

0-73 

— 

0-54 

Diffusion  constant,  D^o  x  10'^ 

12-2 

— 

11-3 

Partition  coefficients 

2-butanol/0-2%TCA: 

0-72 

1-82 

— 

n-butanol/6%  HOAc-3-5%  NaCI: 

0 

— 

0-95 

ACTH  activity  (units/mg.) 

150 

125 

80-100 

MSH  activity  (units/mg.) 

1x105 

1-7x105 

X 

A  comparison  of  the  amino  acid  sequences  of  both  ^-MSH  and  the  bovine 
MSH  with  that  of  the  corticotropins  reveals  that  there  is  a  central  hepta- 
peptide  sequence  that  is  common  to  both  molecules  : 


Corticotropins  :        Ser, 
Melanotropins  :  . . .  Pro. 


Tyr. 
Tyr. 


Ser. 
Lys. 


Met.Glu.Hls.Phe.Arg.Try.Gly. 
Met.Glu.His.Phe.Arg.Try.Gly. 


Lys. 
Ser. 


Pro. 
Pro. 


Val. 
Pro. 


It  may  also  be  noted  that  were  it  not  for  a  transposition  of  the  order  of  the 
serines  and  lysines  on  either  side  of  the  heptapeptide  core,  the  area  of  iden- 
tity between  the  two  hormones  would  extend  to  11  amino  acids.  It  appears 
likely  that  the  presence  of  this  heptapeptide  sequence  in  the  corticotropins 
explains  their  intrinsic  melanocyte-stimulating  activity. ^°§  Thus,  it  can  be 
postulated  that  it  is  by  virtue  of  an  arrangement  of  a  different  sequence  of 
amino  acids  on  each  side  of  the  heptapeptide  core  that  the  molecule  acquires 
adrenal-stimulating  as  well  as  melanocyte-stimulating  activity. 

*  As  trichloroacetate. 

t  As  acetate. 

%  Minimum  effective  dose,  3-5  y/ 100  g.  frog. 

§  Melanocyte  stimulation  is  not  the  only  function  of  a- ACTH  in  the  absence  of  adrenal 
glands.  Recent  studies  of  Menkin"'"  have  shown  that  the  hormone  peptide  exerts  a  direct 
local  effect  of  repressing  in  an  inflamed  area  the  increased  capillary  permeability  in  adrena- 
lectomized  animals.  In  addition,  it  has  been  shown  that  the  adipokinetic  activity  of  a-ACTH 
can  be  demonstrated  in  adrenalectomized  mice  maintained  with  corticoid  therapy.^i 
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III.    LACTOGENIC    HORMONES    (PROLACTINS, 
LUTEOTROPINS) 

The  first  indication  that  a  hormone  of  the  anterior  pituitary  might  effect 
lactation  in  mammals  was  given  by  the  experiments  of  Strieker  and  Grueter^^ 
in  1928.  Several  years  later  grow^th,  to  a  macroscopically  observable  extent, 
of  the  crop  glands  of  the  pigeon  after  the  administration  of  pituitary  extract 
was  reported  by  Riddle  and  Braucher;^^  this  effect  in  the  bird  has  been 
shown  by  subsequent  work  to  be  due  to  the  same  factor  that  elicits  lacta- 
tion in  mammals.  A  number  of  terms  have  since  been  proposed  for  this  factor, 
including  lactogenic  hormone,  prolactin,  galactin,  mammotropin  and  lacto- 
gen. After  the  discovery  that  the  hormone  is  capable  of  maintaining  luteal 
function  in  hypophysectomized  rats,^*  the  term  luteotropin  was  also  intro- 
duced. 

Highly  purified  preparations  of  prolactin  have  been  obtained  from  both 
beef  and  sheep  pituitary  glands. ^'^^  Recently  a  new  simplified  procedure  has 
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Fig.  5.  Countercurrent  distribution  of  sheep  and  beef  prolactin  at  20°C  in  the  2-butanol- 
0-4%  aqueous  dichloroacetic  acid  system. 


been  described^^  for  the  isolation  of  sheep  prolactin;  the  same  method  has 
also  been  employed  for  the  preparation  of  prolactin  in  a  highly  purified 
form  from  bovine  pituitaries.^'  Earlier  observations  that  sheep  prolactin  can 
be  differentiated  from  the  beef  hormone  by  differences  in  solubility  beha- 
vior and  in  tyrosine  content,^  have  now  been  confirmed.^'  It  may  be  seen 
in  Fig.  5  that  the  partition  coefficient  of  beef  prolactin  is  higher  than  that 
of  the  sheep  hormone  in  the  2-butanol-0-4%  dichloroacetic  acid  system. 


d   -^  ■  ^' 
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This  appears  to  be  consistent  with  the  earlier  findings  of  Fleischer''^  that  in 
alcoholic  solution  the  bovine  hormone  is  more  soluble  than  the  ovine. 

Table  6  presents  a  summary  of  the  known  physical  and  chemical  pro- 
perties of  prolactin  from  beef  and  sheep  pituitary  glands.  The  tyrosine  con- 
tent as  determined  by  the  spectrophotometric  method  is  definitely  higher 

Table  6 

SOME    PHYSICAL   AND    CHEMICAL   PROPERTIES 

OF   PROLACTIN    FROM    SHEEP    AND    BEEF 

PITUITARY    GLANDS 


Physical  and  Chemical  Properties 

Sheep 

Beef 

Molecular  weight 

Sedimentation-diffusion 

24,200 

Osmotic  Pressure 

26,500 

26,000 

Analytical  data 

24,100 

— 

Diffusion  coefficient*  (D20) 

8-44x10-7 

— 

Sedimentation  constant*  (520,  w) 

2-19 

— 

Partial  Specific  Volume  (  F20) 

0-739 

— 

Isoelectric  pointf  (pH) 

5-73 

5-73 

Specific  Rotation 

-40-5° 

-40-5° 

Partition  Coefficient 

(2-butanol/0-35%  aqueous  tri- 

chloroacetic acid) 

1-58 

2-07 

Tyrosine,  %  (by  wt.) 

5-26 

6-62 

Tryptophan,  %  (by  wt.) 

1-69 

1-75 

Cystine,  residue /mole 

3 

3 

A'-terminal  amino  acid 

Threonine 

Threonine 

C-terminal  amino  acid 

none 

none 

for  the  beef  hormone  than  for  the  sheep  hormone.  Both  hormones  have 
a  molecular  weight  of  26,000  and  an  isoelectric  point  at  pH  5-73;  they  both 
consist  of  a  single  peptide  chain  with  211  amino  acid  residues^^  and  the 
sequence  Thr.Pro.Val.Thr.Pro . . .  at  the  TV-terminus.^"  Studies'  in  which 
prolactic  preparations  of  both  species  were  subjected  to  reaction  with  carb- 
oxypeptidase  and  to  hydrazinolysis,  have  given  no  indication  that  either 
hormone  possesses  a  C-terminal  residue  at  all.  Prolactin  that  has  undergone 
oxidation  with  performic  acid  has  been  found  to  behave  as  a  single  com- 
ponent in  electrophoresis,  sedimentation,  and  chromatography,  as  well  as 
according  to  the  results  of  A^-terminal  residue  analysis. ^^  From  these  data, 
as  well  as  from  the  absence  of  evidence  of  any  C-terminal  residue,  it  was 
postulated  tentatively  that  the  prolactin  molecule  has  a  cyclic  configuration 
at  the  C-terminus.'  This  tentative  structure  has  now  been  verified  by  the 

♦  Determined  in  0-1  M-NaHCOa  solutions. 

t  Estimated  in  acetate  buflfers  of  0  05  ionic  strength. 
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identification  of  cysteine  as  the  C-terminal  residue  after  reductive  cleavage 
of  the  disulfide  bridges*  of  the  hormone  protein.  ^^ 

In  earlier  investigations,^^  difficulties  were  encountered  in  attempting  to 
cleave  the  disulfide  bonds  in  the  hormone  by  reductive  procedures.  Even 
in  the  presence  of  6m  urea  at  pH  9-4,  the  disulfide  bridges  in  prolactin  are 
only  partially  reduced.^''  For  the  complete  reductive  cleavage  of  the  di- 
sulfide bonds,  the  following  procedure  has  been  shown  to  be  effective: 
100  mg.  of  sheep  or  beef  prolactin  were  dissolved  in  10  ml.  of  10m  urea 
containing  3%  mercaptoethanol,  and  4  ml.  of  cone.  NH4OH  were  added 
to  bring  the  pH  of  the  solution  to  9-8.  The  whole  solution  was  placed  in 
a  cellophane  bag  and  dialyzed  against  125  ml.  of  the  reducing  mixture  (i.e., 
10m  urea,  3%  mercaptoethanol,  pH  9-8)  in  a  glass-stoppered  cylinder.  After 
3  days,  the  contents  of  the  bag  were  placed  in  a  beaker  containing  1  g.  of 
iodoacetamide;  10  minutes  later,  the  whole  solution  was  dialyzed  against 
cold  running  tap  water.  After  2  hours  of  dialysis,  200  mg.  of  iodoacetamide 
were  added  to  the  bag  to  insure  the  complete  alkylation  of  the  — SH  groups. 
The  reduced  hormone  (prolactin  — SCH2CONH2)  solution  was  thoroughly 
dialyzed  and  lyophilized.  If  prolactin  — SH  was  desired,  the  reduced  solu- 
tion was  not  reacted  with  iodoacetamide,  but  instead  was  dialyzed  against 
3%  mercaptoethanol  for  5  days,  lyophilized  and  stored  in  vacuo. 

In  electrophoretic  experiments  on  starch,  prolactin  — SCH2CONH2  mig- 
rated essentially  as  a  single  component  with  a  mobility  higher  than  that  of 
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0.1  M-NOgCOj,  145  mg. 
48  hrs.,  Prolactin 


,  Native,  2  mg 
,  Reduced,  2  mg. 
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Fig.  6.  Zone  electrophoresis  on  starch  of  prolactin 
145  v,  48  hours. 


-S— CH2CONH2;  01  M-NaaCOa, 


the  native  hormone  in  O-lM-NagCOs  (Fig.  6).  In  no  case  was  there  separa- 
tion of  reduced  prolactins  into  more  than  one  component  after  the  reduc 
tive  cleavage  of  the  disulfide  bonds  in  the  hormone  protein.  It  was  further 

*  It  should  be  recalled  that  both  sheep  and  beef  lactogenic  hormones  have  3  residues 
of  cystine  per  mole  of  hormone  protein.^^  .59 
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noted  that  the  amount  of  material  originally  applied  to  the  starch  trough 
could  be  accounted  for  by  the  protein  recovered  from  the  peak. 

Biological  assays  of  prolactin  — SCH2CONH2  or  prolactin  — SH  accord- 
ing to  either  its  crop  sac-stimulating  or  its  luteotropic  activity  indicate  that 
reductive  cleavage  of  the  disulfide  bridges  causes  the  complete  destruction 
of  the  hormonal  activity,  indicating  that  the  integrity  of  the  disulfide  bridges 
is  essential  for  the  biological  function  of  lactogenic  hormones. 

Action  of  Corboxypeptidose  on  Sheep  Proloctm-S-CHgCONHj 
0.161- 


2  4 

Hours 


Fig.  7.  Action  of  carboxypeptidase  on  prolactin  — S — CH2CONH2;  enzyme-substrate 
tio,  1  :35;pH8  5at40°C. 


ratio 

When  either  prolactin  — SCH2CONH2  or  prolactin  — SH  was  allowed  to 
react  with  carboxypeptidase,  it  was  found  that  CyS — CH2CONH2  (or 
CySH),  asparagine,  and  leucine  are  released  successively  (Fig.  7).  The  libera- 
tion of  CyS — CH2CONH2  was  nearly  complete  in  4  hours  under  the  con- 
ditions of  digestion  used.  From  these  studies,  it  was  concluded  that  prolactin 
— SCH2CONH2  possesses  the  sequence  . .  .Leu.AspNH2.CyS — CH2CONH2 
at  the  C-terminus.  That  this  C-terminal  sequence  is  derived  from  reductive 
cleavage  of  a  disulfide  bridge  which  forms  a  loop  at  the  C-terminus  of  the 
untreated  prolactin  molecule  is  further  supported  by  the  identification^^  of 
C-terminal  cysteic  acid  in  performic-acid-oxidized  prolactins,  by  means  of 
the  hydrazinolysis  method  of  Akabori.  Hence,  it  may  be  concluded  that 
sheep  and  beef  prolactins  consist  of  a  single  polypeptide  chain  with  the 
sequence  Thr.Pro.Val.Thr.Pro  at  the  A^-terminus  and  with  an  intrachain  di- 
sulfide loop  at  the  C-terminus  (Fig.  8).  In  this  connection,  it  is  of  interest 
to  recall  that  both  insulin^^  and  the  posterior  pituitary  hormones*  have 
similar  intrachain  disulfide  loops,  involving  only  20-atom  rings.  There  re- 
mains to  be  investigated  the  size  of  the  loop  and  the  location  of  the  remaining 
two  disulfide  bridges  in  prolactin  molecules.  In  the  case  of  insulin,  the  only 
differences  between  pig,  sheep  and  beef  hormones  are  found  among  the 
amino  acid  residues  in  the  ring. 2*^  So  far  no  differences  have  been  found  in 
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the  amino  acids  which  are  released  by  carboxypeptidase  from  the  reduced 
form  of  either  beef  or  sheep  prolactin.^' 

H-Thr.  Pro.  Vol.  Thr.  Pro.tmi^LmmÊÊÊmiÊJLmm 


Fig.  8.  Proposed  gross  structure  for  the  prolactin  molecule.  (S — 8)2  indicates  schematic- 
ally the  existence  of  two  loops  containing  disulfide  bridges.  This  model  is  somewhat 
speculative  and  structures  other  than  that  shown  here  are  possible. 

IV.    GROWTH    HORMONES    (SOMATOTROPINS) 

The  relation  of  the  pituitary  gland  to  growth  was  first  given  experimental 
demonstration  by  Gushing  and  his  collaborators^*  in  their  classical  experi- 
ments on  the  hypophysectomy  of  dogs  in  1910,  and  by  1930  investigations 
with  various  species,  including  fish,  amphibia,  reptiles,  birds  and  mammals, 
had  left  little  doubt  that  the  anterior  pituitary  contains  a  hormone  which 
initiates  resumption  of  growth  in  hypophysectomized  animals.  During  the 
period  between  1944  and  1948,  methods  for  the  isolation  of  growth  hor- 
mone in  a  highly  purified  form  from  the  anterior  lobes  of  bovine  pituitaries 
were  described  by  two  groups  of  investigators.®^®*  During  the  past  few 
years,  the  many  attempts®' •®^-®^''°  that  have  been  made  to  purify  further 
the  products  isolated  by  the  pubUshed  procedures®^®®  have  not  met  with 
success.  The  failure  of  these  attempts  has  lent  support  to  the  belief  that  the 
protein  is  the  hormone.^"-'^ 

Bovine  Growth  Hormone.  The  bovine  somatotropin  is  a  protein  with  a 
molecular  weight  of  45,000  and  an  isoelectric  point  of  pH  6-85;  the  protein 
nitrogen  can  be  accounted  for  completely  by  its  amino  acid  and  amide 
content. ^°  The  hormone  protein  contains  a  total  of  396  amino  acids;  structur- 
ally, it  is  a  branched  polypeptide  chain  with  two  TV-terminal  residues  (phenyla- 
lanine and  alanine)  and  only  one  C-terminal  residue  (phenylalanine),*  as 
shown  in  Fig.  9. 

When  the  bovine  growth  hormone  from  which  two  residues  of  phenyla- 
lanine had  been  removed  by  the  action  of  carboxypeptidase  was  assayed 
for  biological  activity,  the  C-terminal  phenylalanine  was  found  not  to  be 
essential  for  the  biological  function  of  the  hormone.'^  Ghymotryptic  diges- 
tion, performed  at  a  temperature  of  25°,  in  a  buffer  of  pH  9-5  and  with  an 

*  Recent  work  with  H.  Papkoff  has  shown  that  growth  hormone  isolated  from  whole 
sheep  pituitary  glands  by  the  same  procedure  as  that  described  for  beef  glands  has  a 
similar  structure  to  that  of  the  bovine  somatotropin,  with  two  A^-terminal  residues  (phenyla- 
lanine and  alanine)  and  one  C-terminal  residue  (phenylalanine).  Its  molecular  weight  is 
47,800  and  its  isoelectric  point  is  at  pH  6-8. 
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enzyme/hormone  ratio  of  1/300,  did  not  result  in  inactivation  if  hydrolysis 
were  allowed  to  proceed  up  to  about  25%;  however,  the  biological  potency 
was  abolished  by  longer  digestion.  Thus,  it  should  be  noted  that  if  an  active 
digest  is  desired,  the  extent  of  hydrolysis  must  not  be  permitted  to  exceed 
30%.  The  non-protein  nitrogen  can  be  separated  from  the  whole  digest  either 
by  dialysis  or  by  treatment  with  a  5%  solution  of  trichloroacetic  acid.  It 
was  demonstrated  by  biological  assay  that  the  growth-promoting  activity 

Proposed  Structure  of  the  Beef  Growth-Hormone  Molecule 


Phe,  Ala.  -— — ^— ^^■— ■^— ^^  Ala  Phe,  Phe. 

(S-S)^ 
Alo,  Phe.  Ala.  ^— ^^^^^^-i— 


J 


Fig.  9.  Proposed  gross  structure  for  the  bovine  somatotropin  molecule.  This  model  is 
somewhat  speculative  and  structures  other  than  that  shown  here  are  possible. 

resides  in  the  remaining  components  (the  core),  which  are  non-dialyzable 
and  insoluble  in  the  trichloroacetic  acid  solution.'^ 

Boundary  electrophoresis  of  the  core  has  not  revealed  any  component 
that  behaves  like  the  untreated  bovine  hormone.  A^-Terminal  analysis  of  the 
core  reveals  a  number  of  new  residues  (threonine,  serine,  tyrosine,  lysine, 
etc.),  in  addition  to  phenylalanine  and  alanine  ;  if  we  assume  that  the  ter- 
minal phenylalanine  and  alanine  residues  have  come  from  the  undigested 
bovine  hormone,  it  can  be  estimated  that  the  native  hormone  in  the  core 
amounts  to  less  than  20%,  a  percentage  certainly  not  suflEicient  to  account 
for  the  biological  activity  of  the  core.  From  these  results,  it  may  be  con- 
cluded that  the  activity  does  not  depend  upon  the  integrity  of  the  bovine 
protein,  and  that  the  growth-promoting  activity  resides  in  a  center  (or 
centers)  of  activity  in  the  molecule. 

Human  and  Monkey  Growth  Hormones.  In  recent  years,  many  attempts 
have  been  made  to  apply  growth  hormone  clinically  to  humans  as  an  effec- 
tive therapeutic  agent,  but  these  attempts  have  met  with  disappointment.'* 
One  of  the  obvious  explanations  for  this  failure  is  that  the  bovine  growth 
hormone  is  chemically  different  from  the  hormone  derived  from  human 
glands.  Indeed,  it  has  been  shown  that  the  growth  hormone  prepared  from 
fish  pituitaries  is  active  in  fish,'^  but  not  in  rats;'^  and  that  monkeys  are 
not  responsive  to  bovine  growth  hormone,  whereas  they  are  responsive  to 
growth  hormone  prepared  from  pituitaries  of  their  own  species."'^  Our 
recent  investigations  with  human  and  monkey  pituitary  glands  indicate  that 
the  human  and  monkey  growth  hormones  are  similar  to  each  other  in  struc- 
ture and  properties,  but  that  they  both  differ  completely  from  the  hormone 
molecule  isolated  from  the  pituitaries  of  cattle.''^ -^^ 
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Table  7  outlines  the  procedure  for  the  isolation  of  growth  hormone  from 
either  monkey  or  human  pituitary  glands.'^"'' '^^  The  primate  somatotropins 
have  been  assayed  for  their  growth-promoting  activity  according  to  the 
stimulation  of  the  width  of  uncalcified  tibial  cartilage  in  hypophysectomized 
rats,  and  their  specific  activities  were  found  to  be  comparable  to  that  of  the 
bovine  hormone. '^'^^  As  is  to  be  expected,  human  and  monkey  growth  hor. 
mones  cause  a  marked  retention  of  nitrogen  and  phosphorous  in  patients 
with  mild  hypopituitarism  and  in  hypophysectomized  male  subjects.^^-^^ 

From  sedimentation  and  diffusion  data  and  from  computed  values  for 
partial  specific  volume,"^  the  molecular  weights  for  human  and  monkey 
somatotropins  were  calculated  to  be  27,100  and  25,400  respectively.  These 
values  differ  greatly  from  the  molecular  weight  of  the  bovine  hormone.^" 
Recently,  Ehrenberg  and  Heijkenskjöld^*  reported  similar  differences  between 
the  molecular  weights  of  human  and  bovine  growth  hormones. 

The  electrophoretic  mobihty  of  both  human  and  monkey  somatotropins 
as  a  function  of  pH  was  determined  in  buffers  of  0-1  ionic  strength,  and  it 
was  found  that  the  isoelectric  point  of  the  primate  hormones  is  more  acidic 
than  that  of  the  bovine  hormone.^"  Table  8  summarizes  some  of  the  physico- 
chemical  properties  of  somatotropins  isolated  from  monkey  and  human 
pituitary  glands.  It  may  be  seen  that  the  monkey  hormone  contains  four 
residues  of  cystine,  whereas  the  human  contains  only  two.  In  this  respect, 
the  monkey  hormone  is  more  like  the  bovine  somatotropin. 

Table  8 

PHYSICOCHEMICAL   CHARACTERISTICS    OF 

GROWTH  HORMONE  ISOLATED  FROM  HUMAN 

AND    MONKEY    PITUITARY    GLANDS 


Physicochemical  characteristics 

Monkey 

Human 

Sedimentation  constant  (^20.  to) 

1-88 

247 

Diffusion  constant  (DgoX  10'') 

7-20 

8-88 

Molecular  weight 

25,400 

27,100 

Isoelectric  point  (pH) 

5-5 

4.9 

Tyrosine  (Residues/mole) 

10 

8 

pKt* 

10-2 

9-8 

Tryptophan  (Residues/mole) 

1 

1 

Cystine  (Residues/mole) 

4 

2 

Methionine  (Residues/mole) 

6 

4 

Total  amino  acid  residues 

241 

245 

From  investigations  of  the  N-  and  C-termini  by  both  enzymic  and  chemi- 
cal methods,^" -^^  it  was  postulated  that  both  human  and  monkey  somato- 
tropins are  composed  of  a  single  polypeptide  chain  with  phenylalanine  as 

*  pK  of  phenolic  group. 
Wps 


322  CHOH  HAO  LI  [18 

both  C-  and  A^-terminal  residues.  Since  the  human  and  monkey  hormones 
have  two  and  four  cystine  residues^'*  respectively,  it  might  be  anticipated 
that  all  — S — S —  bridges  would  be  located  intramolecularly  along  the  poly- 
peptide chain.  This  was  found  to  be  the  case  when  performic  acid-oxidized 
somatotropins  were  submitted  to  electrophoresis  and  ultracentrifugation. 
Hence,  it  was  proposed^"  that  human  and  monkey  growth  hormone  have 
the  generalized  gross  structure  shown  in  Fig.  10. 

Proposed  Structure  of  the  Human  and  Monkey  Growth-Hormone  Molecule 


Monkey:         Phe  '  ^  (Ala,Gly)Phe 


r 


Human:  Phe  '  ■  Ala(Tyr,Leu)Phe 

Fig.  10.  Proposed  gross  structure  for  the  human  and  monkey  somatotropins.  This  model 
is  somewhat  speculative  and  structures  other  than  that  shown  here  are  possible. 

It  has  been  demonstrated^"  that  the  removal  of  C-terminal  phenylalanine 
from  either  human  or  monkey  somatotropin  by  the  action  of  carboxy- 
peptidase  does  not  result  in  any  loss  of  the  growth-promoting  activity.  It 
is  also  possible  to  show  that  limited  digestion  of  both  somatotropins  with 
chymotrypsin  does  not  lead  to  inactivation.^"  Human  somatotropin  was  not 
inactivated  by  the  action  of  chymotrypsin  only  if  hydrolysis  did  not  exceed 
10%;  longer  digestion  did  diminish  the  biological  activity.  On  the  other 
hand,  the  monkey  hormone  could  be  digested  to  19%  without  any  change 
in  the  growth-promoting  potency;  as  expected,  digestion  with  the  enzyme 
to  any  further  extent  was  found  to  result  in  the  loss  of  the  biological  activity. 
As  in  the  case  of  bovine  somatotropin,^''  these  results  seem  to  indicate  that 
the  integrity  of  the  intact  protein  is  not  essential  for  the  activity  of  the 
primate  growth  hormone. 

Whale  Somatotropin.  There  has  been  only  one  report  in  the  literature  in 
connection  with  growth-promoting  activity  of  extracts  of  whale  pituitaries. 
In  1934,  Valsö^^  reported  that  the  content  of  growth-promoting  activity  in 
the  anterior  pituitary  of  blue  whale  approached  that  found  in  bovine  pitui- 
taries. In  recent  investigations  with  H.  Papkoff,  growth  hormone  from 
anterior  lobes  of  whale  pituitary  glands*  has  been  successfully  isolated  in 
*  The  anterior  pituitaries  ranged  in  weight  from  6-40  g.,  the  average  being  about  21  g. 
The  whales  were  all  of  the  humpback  variety,  which  are  commonly  found  off  the  Cali- 
fornia coast  and  which  attain  an  average  length  at  maturity  of  around  40-50  feet.  The 
average  interval  post  mortem  before  removal  of  the  pituitary  was  about  20  hours.  Once 
removed,  the  gland  was  frozen  as  rapidly  as  possible  and  stored  at  — 15°  C.  until  used. 
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a  highly  purified  state;  this  whale  somatotropin  was  demonstrated  to  be 
structurally  more  similar  to  primate  hormones  than  to  sheep  or  beef  somato- 
tropin. 

The  procedure  employed  for  the  purification  and  isolation  of  whale  somato- 
tropin is  essentially  the  same  as  those  described  for  the  bovine^^  and  the 
primate'^ -^^  hormones.  Table  9  presents  a  summary  of  the  average  yield 

Table  9 

YIELD    AND    ACTIVITY    OF    FRACTIONS    OBTAINED    FROM    THE 
ISOLATION    PROCEDURE   OF   WHALE    GROWTH    HORMONE 


Fraction 

Amount 

Activity* 

Total  dose 

No.  of  rats 

Tibia  width 

Wet  weight  of  anterior  lobes 
Lyophilized  tissue 
0-2-0 -4  saturated  (NH4)2S04 
IRC-50  column  chromatography 
5-20%  ethanol  (somatotropin) 

(g.) 

95 

16 

3 

1 

0-3 

(mg.) 

0-40 
0-20 
0  04 
0  04 

4 

9 

8 

15 

(m) 

228  ±5 
232  ±3 
219±4 
238±3 

and  activity  of  fractions  obtained  from  each  step  of  the  isolation  procedure. 
The  pattern  obtained  when  the  somatotropin  concentrate  was  submitted  to  ion 
exchange  chromatography  can  be  seen  in  Fig.  1 1  ;  the  activity  was  found  in 
tubes  140  to  170.  The  purified  product  behaves  as  a  homogeneous  substance 
when  submitted  to  chromatographic  analysis  on  the  same  resin  (Fig.  11).  In 
addition,  no  evidence  of  heterogeneity  was  observed  in  the  ultracentrifuge  or 
in  free  boundary  electrophoresis.  Terminal  group  analyses  indicate  that  whale 
somatotropin  consists  of  one  TV-terminal  phenylalanine  and  one  C-terminal 
phenylalanine,  in  amounts  consistent  with  a  molecular  weight  of  39,900. 
The  isoelectric  point  has  been  determined  in  buffers  of  0-1  ionic  strength 
and  found  to  be  located  at  pH  6-2.  As  is  the  case  in  the  other  somatotropins, 
the  integrity  of  the  whole  protein  molecule  is  not  essential  for  the  growth- 
promoting  activity  of  whale  somatotropin  (Table  10). 

Highly  purified  preparations  of  growth  hormones  from  five  different 
mammalian  species  are  now  available  for  study  and  comparison.  Table  11 
summarizes  some  physicochemical  properties  of  these  five  somatotropins. 
The  molecular  weights  range  from  25,000  for  monkey  growth  hormone  to 
48,000  for  the  ovine  hormone.  Beef,  sheep  and  whale  growth  hormones  have 
isoelectric  points  at  pH  6-85,  6-8  and  6-2  respectively,  while  human  and 
monkey  somatotropins  are  more  acidic,  with  isoelectric  points  at  pH  4-9 
and  5-5  respectively.  Both  beef  and  sheep  growth  hormones  are  composed 

*  Total  dose  in  4  days;  tibia  width  in  mean  ±  S.E.;  tibia  width  for  the  controls,  160  /x. 
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100 


Fig.  11.  Upper:  Chromatography  on  a  cation  exchanger,  Amberlite  IRC-50  resin,  of 
whale  somatotropin  concentrate  (0-2-0 -4  SAS  fraction);  3  cm.  diameter  column  contain- 
ing 220  ml.  pH  5-1  buffer  applied  to  the  column;  15-5  ml. /tube. 

Lower:  Chromatography  on  a  cation  exchanger,  Amberlite  IRC-50  resin,  of  highly 
purified  whale  somatotropin;  0-9  cm.  diameter  column  containing  20  ml.  resin;  15  mg. 
protein  in  10  ml.  pH  5  1  buffer  applied  to  the  column;  arrows  indicate  same  sequence  of 
buffer  as  upper  diagram;  3  0  ml./tube. 


Table  10 


ACTION    OF   CHYMOTRYPSIN    ON 
WHALE    GROWTH    HORMONE 


Time  of 
digestion* 

Nonprotein 
nitrogen 

Bioassayt 

No.  of  rats 

Tibia  width 

(min.) 

0 

80 

150 

240 

(7o) 

0 

26 

38 
48 

7 
9 
9 
5 

230±3  + 
223  ±4 
214±6 
184±5 

*  pH  9-5,  enzyme/substrate,  1/300  (w/w),  25°C. 

t  A  total  dose  of  40  micrograms  in  4  days. 

%  Mean  ±  standard  error;  tibia  width  for  the  controls,  160  /*. 
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of  a  branched  polypeptide  chain  having  two  A^-terminal  residues  (phenyla- 
lanine and  alanine)  and  only  one  C-terminal  residue  (phenylalanine),  whereas 
whale,  monkey  and  human  somatotropins  are  made  up  of  only  one  poly- 
peptide chain  with  phenylalanine  as  both  the  A^-terminal  and  the  C-terminal 
residues.  As  is  evident  from  Table  12,  somatotropins  isolated  from  pituitary 
glands  of  various  species  can  easily  be  differentiated  by  the  A^-  and  C-terminal 

Table  11 

SOME    PHYSICOCHEMICAL   PROPERTIES    OF   VARIOUS 
SOMATOTROPINS 


Physicochemical 

Beef 

Sheep 

Whale 

Monkey 

Human 

characteristics* 

■^201  W 

3-19 

2-76 

2-84 

1-88 

2-47 

Z)20,t^Xl07 

7-23 

5-25 

6-56 

7-20 

8-88 

V 

0-76 

0-733 

0-737 

0-726 

0-732 

Molecular  wt. 

45,000 

47,800 

39,900 

25,400 

27,100 

f/fo 

1-31 

1-68 

1-45 

1-57 

1-23 

Pi 

6-85 

6-8 

6-2 

5-5 

4-9 

Cystine 

4 

5 

3 

4 

2 

A'^-temiinal  Residue 

Phe.Ala 

Phe,Ala 

Phe 

Phe 

Phe 

C-terminal  Residue 

Phe 

Phe 

Phe 

Phe 

Phe 

*  S20,  w^  S\  D20,  to  in  cm^/sec.;  V  in  cc./g.;  f/fo,  dissymmetry  constant;  Pi,  isoelectric 
point;  cystine  in  residues  per  mole. 

Table  12 

N-   AND    C-TERMINAL   SEQUENCES   OF 
SOMATOTROPINS    FROM   VARIOUS    SPECIES 


Somatotropins 

Terminal  sequences 

Amino* 

Carboxylf 

Beefî 

Sheep 

Whale 

Monkey 

Human 

Phe.Ala.Thr 

Ala.Phe.Ala 

Phe.... 
Ala 

Phe(?)Lys.... 
Phe(?)Thr.... 
Phe.Ser.Thr 

....Ala.Phe.Phe 

Tyr.Ala.Phe 

Leu.Ala.Phe 

. . .  .Ala.Gly.Phe 
Tyr.Leu.Phe 

•  By  the  phenylisothiocarbamyl  procedure  of  Edman;  the  quantitative  estimates  of  the 
A^-terminal  residue  were  obtained  by  Sanger's  fluorodinitrobenzene  procedure. 

t  As  revealed  by  the  action  of  carboxypeptidase;  the  C-terminal  residue  was  verified 
by  the  hydrazinolysis  procedure  of  Akabori. 

t  A^-terminal  amino  acid  sequences  were  established  by  identification  of  DNP-peptides 
in  partial  acid  hydroly sates  of  the  bovine  DNP-somato tropin. 
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sequences  in  the  polypeptide  chains.  Moreover,  recent  investigations  with 
T.  Hayashida  showed  that  whale,  monkey  and  human  somatotropins  are 
immunologically  distinct  from  the  bovine  hormone  (Table  13).  On  the  other 
hand,  bovine  somatotropin  cannot  be  differentiated  from  the  ovine  hor- 
mone by  the  precipitin  test. 

Table  13 

PRECIPITIN    TEST    OF   VARIOUS    GROWTH    HORMONES    WITH 
RABBIT    ANTI-SERUM*    TO    BOVINE    GROWTH    HORMONE 


Somatotropin 

Antigen,  mg. 

0-5 

0125 

0032 

0008 

0004 

0002 

0001 

Beef 

Sheep 

Whale 

Monkey 

Human 

+ 
+ 
0 
0 
0 

+ 
+ 
0 
0 
0 

+ 
+ 
0 
0 
0 

+ 
+ 
0 
0 
0 

+ 
+ 
0 
0 
0 

+ 
0 
0 
0 
0 

0 
0 
0 
0 
0 

All  somatotropins  could  be  hydrolyzed  partially  with  chymotrypsin  with- 
out inactivation.  The  extent  to  which  hydrolysis  of  these  five  somatotropins 
could  be  carried  out  with  no  loss  of  growth-promoting  activity  decreases 
in  the  following  order:  Bovine,  sheep,  whale  (25%)>  monkey  (20%)> human 
(10%).  Studies  on  the  rate  of  chymotryptic  digestion  of  these  hormones  of 
the  various  species  indicate  that  human  somatotropin  is  the  most  resistant 
to  the  enzyme  action  whereas  bovine  hormone  is  the  most  susceptible.^" 
These  differences  in  kinetic  behavior  in  the  course  of  enzymic  hydrolysis 
may  also  be  taken  as  an  indication  of  dissimilarity  in  composition  and 
structure.  These  observations  provide  an  additional  instance  of  the  fact  that 
somatotropin  molecules  from  various  species  differ  completely  from  one  an- 
other with  respect  to  chemical  and  structural  characteristics.  The  hormones 
from  all  species  are  equally  active,  however,  in  promoting  growth  in  rats, 
whereas  only  monkey  and  human  hormones  are  effective  in  man.f  Does 
this  mean  that  the  rat  is  capable  of  degrading  the  bovine  or  ovine  hormone 
to  the  active  core,  whereas  humans  do  not  possess  this  ability?  If  this  were 
true,  one  might  then  assume  that  all  somatotropins  possess  an  identical  core 

*  Anti-serum  to  bovine  growth  hormone  was  obtained  by  immunizing  rabbits  as  fol- 
lows: Rabbits  were  injected  subcutaneously  with  5  mg.  antigen  in  1  ml.  adjuvant  (Bayol- 
Arlacel)  and  5  mg.  antigen  in  1  ml.  saline  once  weekly  for  3  weeks.  After  no  injections  for 
3  weeks,  the  same  injection  schedule  was  followed  for  another  3  weeks.  After  no  injections 
for  another  3  weeks,  the  same  injection  schedule  was  followed  for  only  two  weeks.  After 
no  injections  for  another  3  weeks,  the  rabbits  were  bled  by  cardiac  puncture.  For  pre- 
cipitin tests,  0-1  ml.  anti-serum  was  employed  together  with  0-5  ml.  saline  containing  the 
antigen. 

f  Whether  or  not  whale  somatotropin  exercises  growth-promoting  activity  in  man 
remains  to  be  investigated.  From  structural  considerations,  the  whale  hormone  might 
be  expected  to  be  active  in  human  subjects. 
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representing  the  structural  center  responsible  for  the  growth-promoting 
activity. 
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Intervention  sur  l'exposé  du  Professeur  Li 

A  propos  des  groupes  C-terminaux  de  l'hormone  de 
croissance  et  de  l'hormone  lactogène 

M.  JUTISZ 

Laboratoire  de  Morphologie  Expérimentale  et  Endocrinologie, 
Collège  de  France 


Le  Professeur  C.  H.  Li  m'a  demandé  d'apporter,  à  la  fin  de  son  exposé, 
quelques  précisions  au  sujet  d'un  travail  que  Mlle  D.  M.  Meyer  et  moi- 
même  avons  fait  en  collaboration  avec  lui.^  Il  s'agit  de  la  détermination 
des  groupes  C-terminaux  de  l'hormone  de  croissance  et  de  l'hormone  lac- 
togène par  réduction  au  moyen  de  l'hydrure  double  de  lithium  et  d'aluminium. 

Nous  avons  utilisé  dans  ces  recherches  le  procédé  pubHé  par  Fromageot 
et  Coll.  en  1950.^  Ce  procédé,  je  le  rappelle  brièvement,  consiste  à  réduire 
les  groupes  C-terminaux  des  protéines  et  des  peptides  estérifiés  ou  non  en 
alcools  primaires  et  à  isoler,  après  l'hydrolyse  complète  du  produit  réduit, 
les  aminoalcools  correspondants. 

Cette  méthode  a  l'inconvénient  de  ne  pas  toujours  respecter  l'intégrité  du 
produit  traité  et  de  conduire  à  des  ruptures  de  certaines  liaisons  peptidiques 
particulièrement  fragiles. ^•^•*  C'est  donc  pour  nous  entourer  de  quelques 
garanties  que  nous  avons  contrôlé,  à  chaque  stade  de  l'opération,  les  groupe- 
ments N-terminaux  des  produits  obtenus,  par  la  méthode  de  Sanger. 

Hormone  de  croissance  (somatotropine).  Cette  hormone  a  été  estérifiée  par 
la  méthode  de  Fischer  et  Speier,^  dans  le  methanol  chlorhydrique  0,1  N, 
à  25°  C,  pendant  24  heures. 

Les  résultats  de  détermination  des  groupes  N-terminaux  et  C-terminaux 
à  des  différents  stades  du  traitement  de  l'hormone  ont  été  rassemblés  dans 
le  tableau  1. 

On  peut  tirer  de  l'examen  du  tableau  1  les  conclusions  suivantes  : 

1)  la  réduction  de  l'hormone  native  à  35°  C  fait  apparaître  un  groupement 
N-terminal  nouveau  du  glycocolle,  en  plus  de  deux  résidus  d'alanine  et  de 
phenylalanine,  existant  à  l'origine  dans  la  molécule; 

2)  l'estérification  de  l'hormone  native  à  25°  C  engendre  également  un 
groupement  N-terminal  supplémentaire  du  glycocolle; 

3)  la  réduction  de  l'hormone  estérifiée  donne  naissance  à  1,8  résidus  du 
glycocolle; 

4)  en  ce  qui  concerne  les  groupes  C-terminaux,  la  réduction  de  l'hormone 
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Tableau  1 

LES    CHIFFRES    EXPRIMENT  LES    RÉSIDUS    DES    DNP-AMINOACIDES 
OU    DES    AMINOALCOOLS    PAR   47000   G    DE    SOMATOTROPINE 


Résidus  N-terminaux: 

Aminoalcools: 

Somatotropine  native 

(après  correction) 

(après  correction) 

DNP-alanine 

1,00 

DNP-phénylalanine 

1,02 

Somatotropine  native 

DNP-alanine 

1,13 

colamine              0,94 

réduite  à  35°  C 

DNP-phénylalanine 

1,13 

phénylalaninol     traces 

DNP-glycocolle 

1,00 

Somatotropine  ester 

DNP-alanine 

0,92 

(25°  C,  pendant  24  H) 

DNP-phénylalanine 

0,98 

DNP-glycocolle 

0,87 

Somatotropine  ester 

DNP-alanine 

0,85 

colamine              1,50 

réduite  à  35°  C 

DNP-phénylalanine 

1,00 

phénylalaninol     0,86 

DNP-glycocolle 

1,80 

native  fait  apparaître  0,9  résidu  de  colamine  et  seulement  des  traces  de 
phénylalaninol  ; 

5)  on  trouve,  dans  l'hormone  estérifiée  réduite,  1,5  résidus  de  colamine 
et  0,86  résidu  de  phénylalaninol. 

Il  semble  logique  d'admettre,  à  la  lumière  de  ces  résultats,  que  les  résidus 
de  colamine  proviennent  d'une  coupure  de  la  liaison  Gly-Gly  dans  la  chaîne 
peptidique  de  l'hormone,  puisqu'ils  ont  leurs  correspondants  sous  forme  de 
glycocolle  du  côté  aminé,  et  que  0,86  résidu  de  phénylalaninol  est  dû  à  la 
réduction  du  groupe  C-terminal  estérifiée  de  phenylalanine. 

Hormone  lactogène  (prolactine).  Le  procédé  utihsé  dans  le  cas  de  cette 
hormone  a  été  le  même  que  précédemment,  sauf  pour  l'estérification,  qui 
a  été  poursuivie  pendant  70  heures.^  Les  résultats  obtenus  sont  donnés  dans 
le  tableau  2. 

Tableau  2 

LES    CHIFFRES    EXPRIMENT  LES    RÉSIDUS    DES    DNP- 
AMINOACIDES    OU    DES    AMINOALCOOLS    PAR    26-550 
G    DE    PROLACTINE 


Prolactine  native 
Prolactine  ester 

(25°  C,  pendant  70  H) 
Prolactine  ester 

réduite  à  35°  C 


Résidus  N-terminaux 
(sans  correction) 


DNP-thréonine 
DNP-thréonine 

DNP-thréonine 
DNP-X 


0,46 
0,57 

0,58 
0,31 


Aminoalcools  : 
(sans  correction) 


colamine     0,27 
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11  apparaît  d'après  ces  résultats  que: 

1)  l'estérification  ne  provoque  pas  de  coupures  dans  la  chaîne  peptidique 
de  l'hormone  lactogène,  contrairement  au  cas  de  la  somatotropine; 

2)  au  cours  de  la  réduction  de  la  prolactine  estérifiée  il  y  a  apparition  de 
quantités  à  peu  près  équivalentes  de  DNP-aminoacide  non  identifié,  et  de 
colamine;  ces  deux  produits  provenant  probablement  d'une  dégradation 
partielle  de  la  molécule. 

On  peut  admettre,  suivant  ces  résultats,  que  la  prolactine  ne  possède  pas 
de  groupement  C-terminal  libre,  ou  bien  que  ce  groupement,  s'il  en  existe 
un,  soit  résistant  à  la  réduction  par  les  moyens  utilisés. 
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The  structure  and  activity  of  melanocyte- 
stimulating  and  adrenocorticotropic  peptides 

J.  lEUAN  HARRIS* 

Department  of  Biochemistry,  University  of  Cambridge 


The  isolation  of  highly  purified  melanocyte  stimulating  substances  from  pig 
posterior  pituitary  extracts  has  been  reported  from  several  different  labora- 
tories during  the  past  two  years.  (Lerner  and  Lee,  1955;  Lee  and  Lerner, 
1956;  Porath,  Roos,  Landgrebe  and  Mitchell,  1955;  Benfey  and  Purvis, 
1955;  Geschwind,  Li  and  Bamafi,  1956.) 

The  material  isolated  by  Lerner  and  Lee  (1955)  was  found  to  be  a  highly 
basic  polypeptide  with  an  isoelectric  point  of  11 -0-1 1-5,  whilst  both  Porath 
et  al.  (1955)  and  Geschwind  et  al.  (1956)  reported  isoelectric  points  of  5-2 
and  5-8,  respectively,  for  the  melanocyte  stimulating  hormone  (MSH)  pre- 
parations which  they  had  isolated.  This  apparent  discrepancy  was  later  re- 
solved when  Lee  and  Lerner  (1956)  showed  that  both  forms  of  the  hormone 
were  present  in  the  same  extracts  of  pig  posterior  pituitaries.  Consequently 
they  proposed  that  the  basic  polypeptide  which  they  had  originally  isolated 
be  called  a-melanocyte-stimulating  hormone  (a-MSH),  and  that  the  acidic 
polypeptide  hormone  isolated  in  the  other  laboratories  be  called  ^-melanocyte 
stimulating  hormone  (j3-MSH). 

THE    iS-MELANGCYTE-STIMULATING    HORMONE 

(iS-MSH) 

The  complete  structure  of  j8-MSH  was  elucidated  by  Harris  and  Roos 
(1956)  using  material  prepared  by  Roos  according  to  the  isolation  procedure 
of  Porath  et  al.  (1955).  The  amino  acid  sequence  of  the  peptide  hormone 
was  deduced  from  the  results  of  'stepwise'  degradation  studies  on  the  intact 
molecule  and  from  the  characterization  of  peptide  fragments  isolated  after 
digestion  with  trypsin  and  chymotrypsin,  as  shown  in  Fig.  1.  In  this  way 
iS-MSH  was  shown  to  consist  of  eighteen  amino  acid  residues  in  a  single 
polypeptide  chain,  and  to  contain  a  sequence  of  seven  amino  acids, 
Met.Glu.His.Phe.Arg.Try.Gly.  (positions  7-13),  which  had  previously  been 
shown  to  occur  in  the  corticotropins  (e.g.  Bell,  1954);  in  addition,  the  tyro- 
sine and  proline  residues,  in  positions  5  and  15,  respectively,  also  occur  in 
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corresponding  positions  in  the  corticotropin  molecule.  The  structural  simi- 
larity between  j3-MSH  and  corticotropin  is  of  particular  interest  since  it 
provides  a  chemical  basis  for  the  observation  (Bell,  1954;  Dixon,  1956a) 
that  pure  corticotropin  preparations  possess  intrinsic  melanocyte  stimulating 
activity  in  addition  to  their  primary  adrenocorticotropic  action. 
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Fig.  1.  Amino-acid  sequence  of  /3-meIanocyte-stimulating  hormone  (pig  /3-MSH); 
arrows  indicate  points  of  cleavage  by  trypsin  (T)  and  chymotrypsin  (C). 

Geschwind,  Li  and  Barnafi  (1956,  1957a)  have  reported  similar  conclu- 
sions concerning  the  melanocyte  stimulating  peptide  which  they  isolated 
from  pig  posterior  pituitary  extracts,  showing  that  it  was  identical  in  all 
respects  with  the  material  isolated  by  Porath  et  al.  (1955)  and  characterized 
by  Roos  (1956).  Geschwind,  Li  and  Barnafi  (1957Z>)  have  also  reported  the 
isolation  and  structure  of  a  jS-type  melanocyte-stimulating  hormone  from 
ox  pituitaries;  j8-MSH  (ox)  was  found  to  differ  from  j8-MSH  (pig)  only  in 
position  2  (Fig.  1),  where  the  glutamic  acid  residue  in  pig  j8-MSH  is  replaced 
by  a  serine  residue  in  the  ox  hormone. 

THE    a-MELANOCYTE-STIMULATING    HORMONE 

(a-MSH) 

The  structure  of  a-MSH  has  been  determined  by  Harris  and  Lerner  (1957), 
using  material  prepared  by  Dr  Teh  Lee  according  to  the  procedure  described 
by  Lee  and  Lerner  (1956).  Although  it  did  not  contain  either  a  free  TV-terminal 
or  C-terminal  group,  the  amino  acid  sequence  of  a-MSH  could  be  deduced 
from  the  structure  of  peptide  fragments  derived  from  it  by  digestion  with 
trypsin  and  chymotrypsin.  The  results  are  summarized  in  Fig.  2. 
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Fig.  2.  Amino-acid  sequence  of  a-melanocyte-stimulating  hormone  (pig  a-MSH); 
arrows  indicate  points  of  cleavage  by  trypsin  (T)  and  chymotrypsin  (C). 

a-MSH  is  shown  to  consist  of  thirteen  amino  acid  residues  in  a  single 
peptide  chain,  and  the  amino  acid  sequence  is  identical  with  the  A^-terminal 
tridecapeptide  sequence  in  corticotropin.  However,  it  differs  from  the  re- 
lated portion  of  the  corticotropin  molecule  in  that  the  A^-terminal  residue. 
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serine,  occurs  as  an  jV-substituted  derivative,  whilst  the  C-terminal  residue, 
vahne,  occurs  as  the  amide.  The  TV-terminal  moiety  in  a-MSH  has  been 
provisionally  identified  as  TV-acetylserine,  but  additional  confirmatory  ex- 
periments are  still  in  progress.*  The  presence  of  these  additional  substituents 
provides  an  explanation,  both  for  its  highly  basic  iso-electric  point  and  for 
its  unexpected  solubility  in  acetone  (Lee  and  Lerner,  1956). 

STRUCTURAL    AND    ACTIVITY    RELATIONSHIPS 

BETWEEN    THE    a-    AND    jS-MELANOCYTE 

STIMULATING    HORMONES    AND    THE 

CORTICOTROPINS 

The  amino  acid  sequences  of  the  two  melanocyte-stimulating  hormones, 
together  with  the  related  A^-terminal  segment  of  the  corticotropins,  are  illus- 
trated in  Fig.  3.  The  heptapeptide,  Met.Glu.His.Phe.Arg.Try.Gly,  is  common 


CORTICOTROPIN 

1 

1    1 
H-'Ser. 

2 

Tyr. 

3 
Ser. 

e<-MSH(plg) 

1    1 
R-[Ser 

1 

2 
Tyr. 

3 
Ser. 

/â-MSH(plg)    H- 

1       2    3     4 
Asp.GlujGly.Pro. 

5 
Tyr. 

6 
Lys 

4      5     6     7      8      9     IG 
M2t.Glu.Hls.Phe.Arq.Try.  Gly. 


4     S     6     7      8      9      10 
Mci.GluHis.Phe.Arg.  Try.  Gly 


7       8    9     ID      II       12     13 
M«t.Glu.Hls.Phe.  Arg.  Try  Gly 


IS 
Pro. 


-  1 
13   I  14 
Val.iGly--- 

I 

I 


13  1 
Val|-NHa 


39 
Phe  —  OH 


16     17      18 
PrûLys.  Asp- OH 


Fig.  3.  Amino-acid  sequences  of  the  a,  and  j3-melanocyte-stimulating  hormones,  and 
the  related  TV-terminal  segment  of  corticotropin,  from  pig  pituitary  glands. 

to  all  three  molecules,  and  it  is  only  the  interchange  of  lysine  and  serine 
residues  (positions  6  and  14  respectively)  in  ^-MSH  which  prevents  the 
common  sequence  from  extending  to  eleven  amino  acids. 

All  three  molecules  possess  melanocyte-stimulating  activity,  but,  whereas 
a-MSH  and  ^-MSH  appear  to  have  approximately  comparable  potencies, 
corticotropin  is  of  the  order  of  a  hundred  times  less  active  (Lee  and  Lerner, 
1956)  than  a-MSH  when  assayed  according  to  the  method  of  Shizume, 
Lerner  and  Fitzpatrick  (1954). 

Shepherd  et  al.  (1956)  showed  that  active  degradation  products  of 
/3-corticotropin  (e.g.  the  A^-terminal  fragment  containing  24  amino  acid 
residues)  retained  the  same  ratio  of  melanocyte-stimulating  to  adreno- 
corticotropic activity  as  undegraded  j3-corticotropin  (39  amino  acid  residues), 
and  this  important  observation  was  in  itself  a  strong  indication  that  the 
essential  requirements  for  melanocyte-stimulating  activity  were  to  be  found 
within  the  smallest  active  corticotropin  molecule.  The  amino  acid  sequence 

*  The  presence  of  A^-acetylserine  as  the  A^-terminal  group  in  a-MSH  has  now  been 
confirmed.  When  peptide  CHi  (Fig.  2)  was  subjected  to  hydrazinolysis  both  acetylhydrazide 
and  serine  hydrazide  were  formed  and  were  identified  as  described  by  Fraenkel-Conrat  and 
Narita  (p.  260).— Ed. 
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of  a-MSH  now  reveals  that  the  structure  essential  for  melanocyte-stimulating 
activity  must  reside  v^^ithin  the  A^-terminal  tridecapeptide  fragment  of 
corticotropin  ;  and  from  a  comparison  of  the  structures  of  the  two  melanocyte- 
stimulating  hormones  it  would  appear  that  the  structure  essential  for 
biological  activity  is  in  both  cases  to  be  found  within  a  sequence  of  eleven  amino 
acid  residues  (i.e.  positions  2-12  in  a-MSH,  and  5-15  in  j3-MSH),  since  the  iV- 
terminal  tetrapeptide  and  C-terminal  tripeptide  sequences  in  jS-MSH  can 
be  replaced  by  the  A^-substituted  serine,  and  valine  amide  moieties,  respec- 
tively, in  a-MSH. 

Although  the  entire  amino  acid  sequence  of  a-MSH  is  contained  within 
the  corticotropin  molecule,  its  actual  melanocyte-stimulating  activity  is  con- 
siderably less  than  that  of  a-MSH.  This  suggests  that  the  additional  struc- 
tural features  which  are  necessary  for  adrenocorticotropic  activity  may  at 
the  same  time  inhibit  the  potential  melanocyte-stimulating  properties  of  the 
corticotropin  molecule.  For  example,  the  A^-terminal  serine  residue  has  been 
shown  to  be  essential  for  the  adrenocorticotropic  activity,  but  not  for  the 
melanocyte-stimulating  activity,  of  corticotropin  (Dixon,  1956/?).  a-MSH, 
on  the  other  hand,  in  which  the  A^-terminal  serine  occurs  as  an  N-substituted 
derivative,  acquires  enhanced  melanocyte-stimulating  activity,  and  it  could 
be  inferred  that  the  unsubstituted  CH2 — CH —  structure  may,  at  least  in 

I  I 

OH     NH2 

part,   be  responsible  for  the  reduced  melanocyte-stimulating  activity  of 

corticotropin.  In  accord  with  this  hypothesis,  it  has  now  been  shown  (Dixon 

and  Lerner,  unpublished  results)  that  when  the  TV-terminal  serine  residue  in 

corticotropin  is  oxidized  with  periodate,  loss  of  adrenocorticotropic  activity 

is  accompanied  by  a  significant  increase  in  the  melanocyte-stimulating  activity 

of  the  periodate  modified  corticotropin.  Similarly,  Shepherd  et  al.  (1956) 

have  shown  that  when  corticotropin  is  subjected  to  mild  alkaline  hydrolysis 

(which  results  in  the  cleavage  of  peptide  bonds  in  the  A^-terminal  Ser.Tyr.Ser. 

structure),  loss  of  adrenocorticotropic  activity  is  again  accompanied  by  an 

increase  in  melanocyte-stimulating  activity. 

Although  a  conclusive  evaluation  of  these  results  must  await  the  isolation 
and  characterization  of  the  periodate  and  alkali  degradation  products  of 
corticotropin,  it  is  clear  that  corticotropin  can  be  changed  into  a  pre- 
dominantly melanocyte-stimulating  substance,  and  that  the  A^-terminal  serine 
plays  an  important  role  in  determining  which  of  the  two  biological  properties 
shall  predominate. 

The  other  outstanding  question  concerns  the  nature  of  the  smallest  active 
adrenocorticotropic  peptide.  It  is  already  clear  that,  in  addition  to  the 
common  heptapeptide  sequence,  a  free  A^-terminal  serine,  and  in  all  prob- 
ability the  entire  Ser.Tyr.Ser  sequence,  is  essential  for  adrenocorticotropic 
activity.  Preliminary  studies  on  the  selective  removal  of  the  //-terminal  sub- 
stituent in  a-MSH  show  that  the  unsubstituted  tridecapeptide   does  not 
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possess  adrenocorticotropic  activity.  This  indicates  that  a  specific  sequence 
of  an  as  yet  undetermined  number  of  amino  acids  must  also  be  added  to 
the  C-terminal  residue  of  the  common  sequence  in  order  to  produce  an 
active  adrenocorticotropic  peptide. 
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Some  observations  on  the  structural  basis 
of  enolase  activity 

BO  G.  MALMSTRÖM 

Institute  of  Biochemistry,  Uppsala,  Sweden 


Studies  on  pepsin,^  ribonuclease,'^-*  lysozyme,^  and  papain^  have  demon- 
strated that  the  intact  protein  molecule  is  not  necessary  for  the  catalytic 
activity  of  these  enzymes.  Recent  experiments  with  yeast  enolase  have  shown 
that  this  enzyme  also  can  be  considerably  modified  without  affecting  the 
enzymic  activity.  It  has  been  found'  that  the  crystaUine  enzyme,  prepared 
according  to  Warburg  and  Christian,®  contains  varying  amounts  of  several 
active  forms  (enolase  a,  b,  and  c),  characterized  by  different  electrophoretic 
mobilities.  The  b  and  c  enzymes  are  beheved  to  arise  from  enolase  a  by 
proteolytic  action  during  the  yeast  autolysis  [see  Malmström'],  and  this  has 
prompted  a  study  of  the  effect  of  limited  proteolysis  on  the  activity  of 
enolase.  Some  preliminary  results  of  these  experiments,  together  with  a 
number  of  other  observations  on  the  structural  basis  of  enolase  activity, 
will  be  reported  in  the  present  communication. 

The  enzyme  was  purified  by  zone  electrophoresis  and  specific  adsorption 
on  to  the  Mg++  form  of  a  cation  exchanger,  sulphomethyl  cellulose;'  this 
last  step  takes  advantage  of  the  high  affinity  between  enolase  and  its  acti- 
vating ions.  By  the  new  purification  procedure  it  is  possible  to  prepare  gram 
quantities  of  enolase  relatively  easily.  The  method  also  allows  separation 
of  the  different  active  forms  of  the  enzyme,  and  purified  enolase  a  shows 
the  presence  of  only  one  component  both  in  zone  electrophoresis  and  in 
ultracentrifuge  experiments  over  a  wide  range  of  conditions. 

Qualitative  end-group  studies  have  been  made.  Edman's  method^  was 
used  for  the  A^-terminal  residues,  and  DFP  (diisopropyl  fluorphosphate) 
treated  carboxypeptidase^"  for  the  C-terminal  groups.  No  differences  in 
either  N-  or  C-terminal  amino  acids  could  be  found  between  enolase  a  and 
by  showing  that  the  difference  between  these  forms  is  not  in  the  end  groups. 

The  effect  of  digestion  with  trypsin,  carboxypeptidase  and  aminopeptidase 
on  the  activity  of  the  enzyme  has  been  studied.  The  trypsin  and  carboxy- 
peptidase used  were  commercial  preparations  (Worthington),  while  amino- 
peptidase  was  prepared  according  to  Smith  and  Hill  [personal  communication 
(see  also  Ref.^^)].  Both  exopeptidases  were  treated  with  DFP  prior  to  use. 
Trypsin  digestion  led  rather  rapidly  to  a  considerable  drop  in  activity,  and 
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no  attempts  to  isolate  active  fragments  have  as  yet  been  made.  However, 
with  both  carboxypeptidasc  and  aminopcptidase  a  large  number  of  amino 
acids  can  be  removed  from  the  enolase  molecule  without  changing  the 
activity.  Table  1  summarizes  some  results  on  the  liberation  of  amino  acids 

Table  1 

THE   LIBERATION    OF   AMINO    ACIDS    FROM 

ENOLASE  ON  DIGESTION  WITH  EXOPEPTIDASES 

AT    pH    8-5    AND    22° 


Time  of  incubation 
(hr.) 

Leucine  equivalents/mole  of  enolase 

Carboxypeptidasc 

Aminopcptidase 

0-6 

3-1 

10-3 

22-5 

7 
33 
67 
80 

13 
47 
73 
92 

as  a  function  of  time  of  incubation  with  these  two  enzymes.  In  both  cases, 
the  molar  ratio  between  enolase  and  exopeptidase  was  30  :  1.  The  incuba- 
tions were  carried  out  at  pH  8-5;  aliquots  of  the  incubation  mixtures  were 
taken  out  at  diiferent  times  and  the  reaction  terminated  by  precipitating 
the  proteins  with  trichloroacetic  acid  to  a  final  concentration  of  5  per  cent. 
Amino  acids  were  determined  by  the  ninhydrin  method  of  Moore  and 
Stein,^-  and  the  results,  expressed  as  leucine  equivalents,  have  been  cor- 
rected for  small  amounts  of  ninhydrin-positive  material  Hberated  also  in 
the  absence  of  added  peptidase.  It  is  apparent  that  both  peptidases  liberate 
close  to  100  residues  in  about  24  hours.  In  connection  with  the  rather  ex- 
tensive degradation  achieved  with  carboxypeptidasc,  it  is  of  interest  to  note 
that  enolase  contains  no  proline.  The  removal  of  this  number  of  amino 
acids  from  either  the  C-  or  A^-terminal  end  causes  no  appreciable  loss  of 
activity.  The  digestion  with  carboxypeptidasc  was  found  to  cause  a  large 
change  in  the  electrophoretic  mobility  of  the  enzyme. 

The  experiments  described  show  that  about  17  per  cent  of  the  enolase 
molecule  is  not  essential  for  maintaining  the  active  site.  This  is  similar  to 
the  amount  of  digestion  achieved  with  ribonuclease  without  loss  of  activity,^ 
but  considerably  less  than  in  the  case  of  papain,^  where  close  to  70  per  cent 
of  the  molecule  can  be  removed. 

While  limited  removal  of  the  N-  and  C-terminal  chains  does  not  affect 
the  activity,  the  specific  steric  configuration  of  the  active  site  appears  de- 
pendent on  non-covalent  cross-hnkages,  since  the  enzyme  is  completely 
inactive  in  8  m  urea.  This  is  in  contrast  to  findings  with  ribonuclease^  but 
similar  to  the  behavior  of  many  other  enzymes. ^^  With  enolase,  the  unfold- 
ing is  reversible  during  the  first  fev/  hours  of  incubation  with  urea,  since  the 
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activity  is  regained  on  dilution.  However,  after  24  hours  in  urea  solution 
the  inactivation  is  irreversible. 

The  active  enolase  molecule  remaining  after  proteolytic  digestion  still 
contains  about  500  amino  acid  residues,  which  is  much  too  large  to  allow 
structural  studies.  Thus,  as  with  most  other  enzymes,  information  about  the 
active  site  is  of  an  indirect  nature.  The  influence  of  pH^^  and  solvent  com- 
position^^ on  the  kinetics  and  chelating  properties  implicates  the  involve- 
ment of  histidine  both  in  metal  binding  and  proton  transfer.  However,  it 
is  hoped  that  a  more  direct  chemical  attack  will  be  possible  by  utiUzing  the 
strong  binding  of  metal  ions  to  enolase  compared  with  most  proteins  and 
peptides.  ^^  It  is  possible  that,  even  after  the  activity  has  disappeared,  the 
chelating  site  is  left  intact,  and  attempts  to  isolate  metal-binding  fragments 
after  trypsin  digestion  are  now  in  progress. 

I  wish  to  thank  Professor  A.  Tiselius  for  his  stimulating  interest  and  helpful 
discussions.  Mr.  O.  Nylander  has  cooperated  in  most  of  the  experimental  work, 
and  a  detailed  report  will  later  be  published  jointly  with  him. 
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Insulin  crystals:  zinc  atoms  in  the  unit-cell, 
nucleation,  growth  and  shape 

J0RGEN   SCHLICHTKRULL 

Novo  Terapeutisk  Laboratorium,  Copenhagen,  Denmark 


The  number  of  metal  atoms  (Zn,  Cd,  Co  or  Ni)  present  in  the  unit-cell 
(M^ca.  35,000)  of  rhombohedral  insulin  crystals  is  an  important  detail  in 
the  structural  concept.  The  various  values  reported  for  metal  contents  in 
the  literature  are  quite  confusing  in  contrast  to  the  original  ngure,^-^-^  of 
3  atoms/cell,  which  is  also  the  number  of  dimer  elements  in  the  cell.  Since 
the  metal  content  of  insulin  crystals  is  actually  subject  to  gross  variations* 
depending  upon  many  conditions,  attempts  have  been  made  to  determine 
the  number  of  zinc  atoms/unit  cell  which  is  most  relevant  in  structural 
considerations. 

A  quantitative  examination  showed  some  interesting  features  of  the  role 
played  by  zinc  or  the  other  metal  ions  in  the  crystallization.  Insulin  was 
crystallized  with  zinc  in  a  phosphate  buffer  and  recrystallized  in  ammonium 
acetate.  The  crystals  contained  3  atoms  of  Zn/unit-cell  (0-51%  Zn),  in  per- 
fect agreement  with  Scott  and  Fisher's  experimental  results. ^-^  However, 
when  the  crystals  were  recrystallized  once  more,  the  zinc  content  dropped 
to  2  atoms/cell  and  remained  constant  at  that  value  through  several  sub- 
sequent recrystallizations  in  ammonium  acetate.  If  insulin  is  recrystallized 
from  a  citrate  buffer  containing  zinc  ions — which  are  strongly  complexed  to 
the  citrate — then  the  crystals  contain  2  atoms  of  zinc/unit-cell. ^  When  such 
crystals  are  recrystallized  from  a  citrate  buffer  without  the  addition  of  zinc 
ions,  the  new  crystals  are  again  found  to  contain  2  atoms/unit-cell,  but  in 
the  course  of  several  such  recrystallizations  the  content  decreases  a  little 
and  at  the  same  time  the  crystals  deteriorate  in  shape  and  amorphous  par- 
ticles appear  in  the  precipitates.  This  slight  decrease  in  the  zinc  content  is 
not  observed  with  recrystallizations  from  ammonium  acetate  and  can  pre- 
sumably be  accounted  for  by  competition  between  insulin  and  citrate  with 
respect  to  the  binding  of  zinc  ions.  It  is  noteworthy  that  the  zinc  content  of 
the  citrate-recrystallized  insulin  (2  atoms/unit-cell)  is  not  dependent  on  the 
pH  in  the  interval  over  which  crystallization  takes  place  (5*5<pH<7).  It 
is  concluded  from  these  experiments  that  among  the  many  combinations 
between  zinc  and  insulin  crystals,  the  combination  containing  2  atoms/unit- 
cell  is  a  very  strong  complex  and  represents  the  minimum  content  of  zinc 
ions  sufficient  for  crystallization. 
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This  value,  2  atoms/unit-cell,  is  also  significant  in  quite  other  investiga- 
tions, viz.  in  the  ultracentrifugal  studies  of  Cunningham,  Fischer  and  Vestling 
on  insulin  solutions.^  These  authors  found  that  the  sedimentation  constant 
was  independent  of  the  zinc  content  when  the  zinc  concentration  varied  from 
0'2  to  about  0-6-0 -8  moles  of  zinc  per  12,000  grams  of  insulin.  However, 
a  rise  in  the  zinc  concentration  above  the  latter  value  brought  about  a  linear 
rise  in  the  sedimentation  constant.  The  minimum  amount  sufficient  for 
crystallization,  2  atoms  of  Zn/unit-cell,  is  equivalent  to  0-7  moles  of  zinc 
per  12,000  grams  of  insulin  and  it  is  therefore  likely  that  the  sudden  rise  in 
the  sedimentation  constant  at  this  value  corresponds  to  the  fact  that  the 
condition  for  crystallization  is  reached  at  this  Zn  content. 

The  shape  of  the  insulin  crystals  depends  on  the  species.'  Recrystallized 
pig-insulin  crystals  are  perfectly  shaped  single  rhombohedrons,  but  beef 
insulin  crystals  are  twin-like  bodies  of  a  very  different,  star-Hke  appearance. 
(It  should  be  noted  that  the  internal  structure  is  rhombohedral  in  all  cases 
considered  here.)  If  beef-insulin  crystals  are  recrystallized  in  one  of  the 
usual  buffers  and  sodium  chloride  is  added  to  the  buffer,  it  can  be  observed 
that  the  shape  of  the  crystals  formed  depends  on  the  concentration  of  sodium 
chloride.  If  the  concentration  of  sodium  chloride  exceeds  approx.  6%,  the 
crystals  attain  the  perfect  rhombohedral  shape  characteristic  of  pig-insulin. 
This  rather  abrupt  change  in  shape  is  associated  with  a  change  in  the  mini- 
mum amount  of  zinc  ions  sufficient  for  crystallization.  If  the  buffer  contains 
so  much  sodium  chloride,  e.g.  7%,  that  the  shape  of  beef  insulin  crystals 
becomes  rhombohedral,  the  minimum  adequate  zinc  concentration — and 
this  applies  also  to  pig-insulin — is  no  longer  2  but  4  atoms/unit-cell.  Other 
experiments  have  shown  that  the  chloride  ion  is  responsible  for  these  trans- 
formations and  that  the  other  halogen  ions,  with  the  exception  of  fluoride, 
have  the  same  effect.  Nitrate,  sulphate  and  phosphate  proved  to  have  no 
effect.  The  transformations  are  not  contingent  upon  the  sodium  ions  since 
the  latter  can  be  replaced  by  potassium,  ammonium  and  calcium. 

Microscopical,  goniometric  measurements  on  a  large  (1-3  mm.)  perfect 
insulin  rhombohedron'  gave  a  value  (114°  22' ±4'  (S.E.))  which  was  in  agree- 
ment with  the  X-ray  value^  (1 14°  16')  for  the  unit-cell  in  wet  insulin  crystals. 

The  kinetics  of  insulin  crystalhzation  have  been  studied  in  detail.  It  was 
found  that  the  star-shaped  or  twin-like  beef-insulin  crystals  have  a  tendency 
to  form  monodisperse  suspensions.  It  means  that  the  nuclei  are  formed 
almost  exclusively  in  the  initial  stage  of  crystallization  where  the  insulin 
concentration  is  greatest.  The  sharp  and  perfect  pig-insulin  rhombohedrons 
and  the  identically  shaped  beef-insulin  crystals  formed  in  sahne  are  very 
polydisperse.  The  rate  of  nucleation  was  deduced  from  the  very  peculiar 
observation  that  the  ratios  between  the  total  volume,  total  surface,  total 
length  and  number  of  the  crystals  remain  constant  in  the  course  of  crystalliza- 
tion from  the  moment  when  the  first  few  crystals  had  formed.  It  is  possible 
to  account  for  the  nucleation  in  the  most  simple  way  if — and  apparently 
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only  if— the  ad  hoc  hypothesis  is  made  that  during  crystallization  the  crystals 
throw  off  nuclei  or  crystal  fractions  at  a  rate  proportional  to  their  surface 
area  and  to  their  hnear  growth  rate.  This  hypothesis  finds  support  in  the 
microscopical  appearance  of  the  suspension,  since  the  very  small  (<4/x) 
crystals  are  predominantly  found  on  the  vertices  or  faces  of  the  bigger  crys- 
tals. It  was  furthermore  observed  that  the  nucleation  is  enhanced  if  the 
surface  of  the  crystallization  system  is  increased,  and  suppressed  by  Tween-80 
or  by  a  surface  layer  of  paraffin.  The  linear  rate  of  deposition  of  insulin  on 
the  crystal  faces  was  approx.  1  /x/minute  at  the  maximum.  It  was  shown  to 
be  a  function  of  the  insulin  concentration.  In  the  absence  of  added  halo- 
genide,  it  was  proportional  to  the  third  power  of  the  supersaturation  and 
in  7%  NaCl  to  the  second  power. 

The  deposition  of  insulin  on  the  various  crystal  faces  was  studied  by 
letting  the  crystals  first  grow  in  a  buffer  containing  Indian  ink  and  then 
in  a  non-coloured  buffer.  The  Indian  ink  became  incorporated  in  the  crystal 
bodies,  and  after  some  further  growth  in  the  colourless  medium,  it  was 
observed  that  insulin  is  only  deposited  on  the  three  faces  meeting  in  one  of 
the  obtuse  vertices.  The  same  phenomenon  was  also  observed  in  seeding 
experiments  with  non-coloured  crystals.  Since  the  unit-cells  are  identically 
orientated  in  the  crystals,  an  'up  and  down'  with  respect  to  growth  in  the 
unit-cell  can  be  concluded. 

If  it  is  true  that  the  rate  of  deposition  on  a  crystal  face  is  the  same  on  the 
mirror  image  of  that  face,  then  it  can  be  concluded  from  these  observations 
that  the  unit-cell  of  the  rhombohedral  insulin  crystals  has  no  center  of 
symmetry.  This  conclusion  is  in  agreement  with  the  lack  of  centro  symmetry 
in  the  unit-cell  of  the  rhombohedral  insulin  crystal. 
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Peptides — cont. 
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tobacco  mosaic  virus,  251,  290 
trypsinogen,  171 
configuration,  25-62 
homogeneity,  93-99,  140-141,  144, 
183,  262-263,  304,  308,  314, 
316,  323-325 
immunological  criteria,  1 84,  240- 
244 
purification, 
chromatography,    96-1Û3,    197, 

199,  241-243,  324 
countercurrent  distribution,  304, 

314 
electrophoresis,  94-97,  140-141, 
291,  307-308,  316 
species  specificity,  66-75,  144-147, 

305-327 
structure,  genetic  control  of,  148- 

151,  247,  262-263,  298-301 
surface,  polarity  of,  43 
PyrroUdone  carboxyl  peptides,  268 


Random  coils,  in  denatured  proteins, 
48-59 
in  polyelectrolytes,  35-38,  47-49 
Ribonuclease 
action  of  carboxypeptidase  on,  233 
chymotrypsin  on,  212,  219-220 
pepsin  on,  233-235 
subtilisin  on,  229,  233,  236 
trypsin  on,  211-220 
amino  acid  sequence,  213,  230 
carbobenzoxylation,  224 
dialysis  rate,  108,  110 
disulphide  bonds,  oxidation,  211, 
219-221,  229,  236 
reduction,  224-228,  230-232,  236 
immunological  heterogeneity,  241- 
246 
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Rîbonuclease — cont. 
inhibition  by  alkaline  pH,  236 
antibodies,  244,  246 
esterification,  236 
guanidine  salts,  236,  237 
oxidation,  236,  238 
pepsin  digestion,  233,  235,  238 
reduction,  230-232 
optical  rotation,  234,  235 
oxidized,  deuterium  exchange,  32 
dialysis  rate,  108,  1 10 
enzymic  hydrolysis  of,  211-215, 
219,  227 
reduced   and   carboxymethylated, 
225-227,  230-232 
enzymic  hydrolysis  of,  227,  232 
viscosity,  38,  49,  58,  60,  234-235 
Ribonucleic  acid,  234,  250,  256,  262, 
272 

^Salt  links'  in  proteins,  42-43 
Sedimentation  coefficients,  see  under 
individual  proteins,  physico- 
chemical  properties 
Serum  albumin 
amino  acid  composition,  292 
species  variations  in,  73,  79 
configuration  and  pH,  52-54 
diffusion  constant,  52 
disulphide  bonds,  45,  62 
immunologically   active  fragment 
of,  290-294 
amino  acid  composition,  292 
dimerization,  291-293 
physico-chemical        properties, 

291-292 
stabiUty,  293 
polymerization,  62 
sedimentation  constant,  52 
tyrosine  ionization  in,  53 
viscosity,  38,  52 
Serum  proteins 
in  amphibian  metamorphosis,  296- 

297 
as  iso-antigens,  298-301 
Subtilisin 
action    on   DNP-tobacco   mosaic 
virus  protein,  268 
ribonuclease,  229,  231-233,  236 
Sulphydryl  groups 
in  haemoglobin,  136 
in  papain,  184-186,  202-206 


reaction  with  p-chloromercuriben- 
zoate,  130,  136,  184-186,205 

iodine,  252 

iodoacetamide,  185,  186,  316 

iodoacetate,  224-232 

methyl  mercury,  252 
in  serum  albumin,  292-293 
in  tobacco  mosaic  virus,  251-252 

Titration  curves,  see  Ionization 
Tobacco  mosaic  virus 
hydroxylamine,  reaction  with,  266 
metal  ion  binding  by,  255-256 
peptides  from  proteolysis  of,  256- 

260,  267-269 
protein    subunits,    249-251,    263, 
271-273 
bonding  of,  253-256 
molecular  weight   of,   250-251, 
267-269 
stability,  250-251,  256,  263 
strains,   differences   between,   73, 

247,  262-263 
sulphydryl  groups,  251-252 
trichloroacetic  acid  on,  266,  269 
X-ray  analysis,  271-273 
Trypsin 
action  on  carbobenzoxylated  pro- 
teins, 224,  227-228 
chymotrypsinogen,         155-159, 

162-163 
corticotropin,  310-311 
enolase,  338-339 
lysozyme,  227-228,  280-285,  289 
melanotropin,  305,  333-334 
papain,  oxidized,  188-192 
ribonuclease,  211-221,  227 
tobacco  mosaic  virus,  257 
trypsinogen,  169-173 
dialysis  rate,  108,  110 
reaction   with    organophosphorus 
esters, 
effect  of  acetylation  on,  177 
effect  of  urea  on,  176-178 
specificity  of,  288-289 
C-terminal  configuration,  171 
Trypsinogen 
activation  of,  169-174 
acetyl  derivatives  of,  171 
effect  of  urea  on,  173 
amino  acid  composition,  81 
basic  peptides  from,  85-87 
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Trypsinogen — cont. 
cysteic  acid  peptides  from,  84 
dialysis  rate,  108 
Tryptophan  in  proteins,  oxidation  of, 

224-227,  283,  285 
Tyrosine  in  proteins 
hydrogen  bonding  of,  41,  42,  179, 

236-238,  252-255 
ionization  of,  43,  50,  53,  55,  57,  59, 
236,  252 

Ultra-violet  absorption  spectra 
myoglobin,  146-147 
ribonuclease,  59 
effect  of  chemical  modification 
on,  233-238 
tobacco  mosaic  virus,  252-253 
Urea 
effect  on  dialysis  rate  of  proteins, 
110 
enolase  activity,  339 
pepsin  autolysis,  179-181 
ribonuclease  activity,  233-235 


serum  albumin,  62 
trypsinogen  activation,  173 
hydrogen  bonding  of,  in  solution, 
41 


Vasopressin,  species  differences,  73 
Viscosity 

of  polyelectrolytes,  48 

of  proteins,  37-39,  52-60 

Water,  hydrogen  bonding  and  protein 
configuration,  39,  41 

X-ray  analysis 
haemoglobin,  136-139 
myoglobin,  125-135 
isomorphous    replacement 

method,  126-132 
three-dimensional  Fourier  syn- 
thesis, 132-135 
tobacco  mosaic  virus,  255,  271-273 
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